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0 NANBV diagnostics and vaccines. 

@ A new virus. Hepatitis C vims (HCV). which has proven to be the major etiologic agent of blood-borne 
NAN6H. was discovered by Applicant The initial work on this virus, which Includes a partial genonnic sequence 
of the prototype HCV Isolate, is described in EPO Pub. No. 318.216, and PCT Pub. No. WO/89/04669»»The 
present invention^ which in part is based on new HCV sequences and polypeptides which are not disclosed in 
the above-crted publications. Includes the application of these new sequences and polypeptides in immunoas- 
says, probe diagnostics. anti-HCV antibody production, PGR technology, and recombinant DNA technology. 
Included within the invention also are novel Immunogenic polypeptides encoded within clones containing HCV 
cDNA, novel mett^s for purifying an immuncgenic HCV polypeptide, and antisense polynucleotides derived 
from HCVcDNAT* 
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NANBV DIAGNOSTICS AND VACONES 



Technical Reid 

The invention relates to materials and methodologies for managing the spread of non-A, non-B hepatitis 
virus (NANBV) infection. More specifically, it relates to polynucleotides derived from the genome of an 
5 etiologic agent of NANBH. tiepatitis C virus (HCV). to polypeptides encoded therein, and to antibodies 
directed to the polypeptides. These reagents are useful as screening agents for HCV and Its infection, and 
as protective agents against the disease. 
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U.S. Patent No. 4.493.890 



Background Art 

s 

Non-A, Non-8 hepatitis (NANBH) Is a transmissible disease or family of diseases that are believed to be 
viral-induced, and that are distinguishable from other fomis of viral-associated liver diseases. Including thai 
caused by the known hepatitis viruses, i.e.. hepatitis A virus (HAV). hepatitis B virus (HBV), and delta 
hepatitis virus (HOV), as well as the hepatitis induced by cytomegalovirus (CMV) or Epsteln-Barr virus 
10 (EBV). NANBH was first identified in transfused individuals. Transmission from man to chimpanzee and 
serial passage in chimpanzees provided evidence that NANBH is due to a transmissible infectious agent a 
agents. 

Epidemiologic evidence is suggestive that there may be three types of NANBH: the water-borrw 
epidemic type; the blood or needle associated type; and the sporadically occurring (community acquiretl 

rs type. However, the number of agents which may be the causative of NANBH are unknown. 

Clinical diagnosis and Identification of NANBH has been accomplished primarily by exclusion of ottia 
viral markers. Among the methods used to detect putative NANBV antigens and antibodies are agayr-gd 
diffusion, counterimmunoeleclrophoresis, immunofluorescence microscopy, immune electron microscopy 
radioimmunoassay, and enzyme-linked immunosorbent assay. However, none of these assays has proved 

20 to be sufficiently sensitive, specific, and reproducible to be used as a diagnostic test for NANBH i 

Previously there was neither clarity nor agreement as to the identity or specificity of the antigei 
antibody systems associated with agents of NANBH. This was due, at least in part, to the prior or cq 
infection of HBV with NANBV in individuals, and to the known complexity of the soluble and particular 
antigens associated with HBV, as well as to the integration of HBV DNA into the genome of liver cells. Ii 

25 addition, there is the possibility that NANBH is caused by more than one infectious agent, as well as tt« 
possibility that NANBH has been misdiagnosed. Moreover, it is unclear what the serological assays deted 
in the semm of patients with NANBH. It has been postulated that the agar-gel diffusion and counlerim 
munoelectrophoresis assays detect autoimmune responses or nonspecific protein interactions that some 
times occur between semm specimens, and that they do not represent specific NANBV antigen-antibodi 

30 reactions. The immunofluorescence, and enzyme-linked immunosorbent and radioimmunoassays appear H 
detect low levels of a rheumatoid-factor-like material that is frequently present in the serum of patients witt 
NANBH as well as in patients with other hepatic and nonhepatic diseases. Some of the reactivity delectel 
may represent antibody to host-determined cytoplasmic antigens. 

There have been a number of candidate NANBV. See, for example the reviews by Prince {1983J 

35 Feinstone and Hoofnagle (1984). and Overby (1985, 1986. 1987) and the article by Iwarson {1987J 
However, there is no proof that any of these candidates represent the etiological agent of NANBH. 

The demand for sensitive, specific methods for screening and identifying carriers of NANBV ani 
NANBV contaminated blood or blood products is significant. Post-transfusion hepatitis (PTH) occurs i 
approximately 10% of transfused patients, and NANBH accounts for up to 90% of these cases. Themaja 

40 problem in this disease is the frequent progression to chronic liver damage (25-55%). 

Patient care as well as the prevention of transmission of NANBH by blood and blood products or d 
close personal contact require reliable screening, diagnostic and prognostic tools to detect nucleic acidi 
antigens and antibodies related to NANBV. In addition, there is also a need for effective vaccines art 
immunotherapeutic therapeutic agents for the prevention and/or treatment of the disease. 

45 Applicant discovered a new virus, the Hepatitis C virus (HCV). which has proven to be tfie ms«o 
etiologic agent of blood-bome NANBH (BB-NANBH). Applicant's initial work, including a partial genomi 
sequence of the prototype^ HCV isolate, CDC/HCVI (also called HCV1), is described in EPO Pub. ^a 
318.216 (published 31 May 1989) and PCT Pub, No. WO 89/04669,^(pu&Jj;§b-e^ 1 June 1989). Ih 
disclosures of these patent applications, as well as*" any corresponding national patent applications, ai 

50 incorporated herein by refereqcf^^These applications^teach. inter alia, reconnbinant DNA methods of cbniif 
and expressing HCV sequwices. HCV polypeptides, HCV immunodiaghostic techniques. HCV proh 
diagnostic techniques, afiti-HCV antibodies, ; and methods of isolating new hCV sequences, indudin 
sequences of new HCV isolates. - 



55 



Disclosure of ttj© lljygQSgJj^^^^ 

The present invention, is based^ and polypeptides that are hot disclosd , 
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in EPO Pub. No. 318.216, or in PCT Pub. No. WO 89/04669. Included within tte invention Is the application 
of these new sequences and polypeptides In. inter alia, immunodiagnostics. probe diagnostics. anti-HCV 
antibody production, PGR technology and recombinant DNA technology. Inciuted within the invention, also, 
are new Immunoassays based upon the immunogenicrty of HCV polypeptide disclosed herein. The new 
5 subject matter claimed herrein. while developed using techniques described ii\ for example. EPO Pub. No. 
318.216, has a priority date which antecedes that publication, or any counterpat thereof. Thus, the invention 
provides novel compositions and methods useful for screening samples for ICV antigens and antibodies, 
and useful for treatment of HCV infections. 

Accordingly, one aspect of the invention is a recombinant polyriucledde comprising a sequence 
to derived from HCV cDNA. wherein the HCV cDNA is in clone 13i. or clone 26j, or clone 58a. or clone 84a, or 
clone CA156e. or clone 167b. or clone pi 14a. or clone CA216a, or clone GA2IOa, or done agSOa. or clone 
205a. or clone 18g. or done 16jh. or wherein the HCV cDNA is of a sequence indicated by nucleotide 
numbers -319 to 1348 or 8659 to 8866 in Rg. 17. 

Another aspect of the invention is a purified polypeptide comprising an epitope encoded within HCV 
ts cDNA wherein the HCV cDNA is of a sequence indicated by nucleotide numters -319 to 1348 or 8659 to 
8866 in Rg. 17. , • *- 

Yet another aspect of the invention is an immunogenic polypeptide produced by a cell transformed with 
a recombinanft expression vector comprising an ORF of DNA derived fronn HCV cDNA. wherein the HCV 
cDNA is comprised of a sequence derived from the HCV cDNA sequence in clone CAZTBa, or clone CA74a. 
20 or clone 13i. or clone CA290a, or clone 33C or clone 40b. or clone 33b. or clo» 25c, or clone 14c. or clone 
8f, or clone 33f, or clone 33g. or clone 39c. or clone 15e. and wherein ttieORF is operably linked to a 
control sequence compatible with a desired host. 

Another aspect of the invention is a peptide comprising an HCV epitope, wherein the peptide is of the 
formula 
25 AAx-AAy. 

wherein x and y designate amino acid numbers shown in Rg. 17, and wherein the peptide is selected from 
the group consisting of AA1-AA25. AA1-AA50. AA1-AA84. AA9-AAt77. AA1-/M0. A^AA20, AA2a-AA25, 
AA35-AA45, AA50-AA100. AA40-AA90. AA45-AA65. AA65-AA75. AA80-9a AA9&-AA120/ AA95-AA110. 
AA105-AA120. AA100-AA150, AA150-AA200. AA155-AA170, AA190-AA210. ilA2(K>-AA250. AA220-AA240. 

30 AA245-AA265. AA250-AA300. AA290-AA330. AA290-305, AA300-AA350. >V^1(KAA330, AA350-AA400, 
AA380-AA395. AA405-AA495. AA400-AA450. AA405-AA415. AA415-AA425. IA4&AA435. AA437-AA582, 
AA450-AA500. AA440-AA460, AA460-AA470. AA475-AA495. AA500-AA550. jlA511-AA690. AA515-AA550. 
AA550-AA600. AA550-AA625, AA575-AA605. AA585-AA600, AA60O-AAe5O. jlA60(WVA625. AA635-AA665. 
AA650-AA700. AA645-AA680. AA700-AA750. AA700-AA725. AA700-AA750. jiA725-AA775. AA770-AA790. 

35 AA750-AA800. AA800-AAai5, AA825-AAa50, AA850-AA875. AA800-AA850. iiA9a[l-AA990. AA850-AA900. 
AA920-AA945. AA940-AA965. AA970-AA990. AA950-AA1000. AA1 000-AA1 OHI. AA1000-AA1025. AA1000- 
AA1050. AA1025-AA1040, AA1 040-AA1 055. AA1 075-AA1 1 75. AA1 050-AA1 200. AA1070-AA1100. AA1100- 
AA1130. AA1140-AA1165. AA1 192-AA1457. AA1195-AA1250, AA1 200-AA1 225. AA1225-AA1250, AA1250- 
AA1300. AA1260-AA1310, AA1260-AA1280. AA1266-AA1428. AA1 300-AA1 331. AA1290-AA1310. AA1310- 

40 AA1340, AA1345-AA1405, AA1345-AA1365. AA1350-AA1400. AA1365-AA13S. AA1380-AA1405. AA1400- 
AA1450. AA1450-AA1500. AA1460-AA1475. AA1475-AA1515. AA1 475-AA1 50). AA1500-AA1550, AA1500" 
AA1515. AA1515-AA1550. AA1 550-AA1 600. AA1545-AA1560, AA1569-AA193. AA1570-AA1590. AA1595- 
AA1610. AA1590-AA1650. AA1610-AA1645. AA1650-AA1690. AA1 685-AA1 771. AA1689-AA1805, AA1690- 
AA1720. AA1694-AA1735. AA1720-AA1745. AA1745-AA1770. AA1 750-AA1 80). AA1775-AA1810. AA1795- 

45 AA1850. AA1850-AA1900, AA1900-AA1950. AA1900-AA1920. AA1916-AA202I. AA1920-AA1940. AA1949- 
AA2124, AA1950-AA2000, AA1 950-AA1 985, AA1980-AA2000. AA2000-AA20a), AA2005=AA2025, AA2020- 
AA2045. AA2045-AA2100. AA2045-AA2070. AA2054-AA2223. AA2070-AA21(D. AA2t00-AA2150. AA2150- 
AA2200. AA2200rAA2250, v AA22p0-AA2325. AA2250r^2330. AA2255-AA227II. AA2265-AA2280, AA228d- 
AA2290. AA2287.AA2385. AA23dti^AA2350. AA2290-AA2310, AA2310-AA23a). AA2330-AA2350, AA2350- 

50 AA2400. AA2348-AA2464. AA2345-AA2415. AA2345-AA2375, AA2370-AA24D. AA2371-AA2502, AA2400!- 
AA2450, AA2400'AA2425, AA2415-AA2450. AA2445-AA2500, AA2445-AA2475. AA2470-AA2480, AA25G0- 
AA2550, AA2505-AA2540, AA2535-AA25B0. AA2550-AA2600. AA2560-AA25ai. AA2600-AA2650, AA26D5- 
AA2620, AA2620-AA2650, AA2640-AA2660. AA265D-AA2700. AA2655-AA267). AA2670-AA2700. AA2700- 
AA2750. vAA2740tAA2760, AA2750-AA2800, AA2755-AA2780. AA2780-AA28a). AA2785-AA2ai0. AA2796- 

55 AA2886; AA2810-AA2i825, AA2800-AA2a50. AA2850-AA2900, AA2850-AA2885. AA2885-AA2905. AA2900- 

■ " 'StiljPM a rifionoclonal antibody directed sjfalnst an epitope encoded In 

HCV c6fiJA;^whe^^ is of a sequence indicated by nucleotfe rwrnbere -319 to 1348 or 
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8659 to 8866 in Fig. 17. or is the sequence present in clone 13i, or clone 26j, or clone 59a. or clone 84a. or 
clone CA156e, or clone 167b. or clone pi 14a, or clone CA216a. or clone CA290a, or clone ag30a. or clone 
205a. or clone I8g. or clone 16jh. 

Yet another aspect of the invention is a preparation of purified polyclonal antibodies directed against a 

5 polypeptide connprised of an epitope encoded within HCV cDNA, wherein the HCV cDNA Is of a sequence 
Indicated by nucleotide numbers -319 to 1348 or 8659 to 8866 in Rg. 17, or is the sequence present in in 
clone 13i. or clone 26j. or clone 59a. or clone 84a, or clone CA156e. or clone 167b, or clone p!14a. or clone 
CA216a. or clone CA290a. or clone agSOa. or clone 205a. or clone 18g, or clone 16jh. 

Still another aspect of the invention is a polynucleotide probe for HCV. wherein the probe is comprised 

10 of an HCV sequence derived fronn an HCVcDNA sequence indicated by nucleotide numbers -319 to 1348 
or 8659 to 8866 in Rg. 17, or from the complement of the HCV cDNA sequence. 

Yet another aspect of the invention is a kit for analyzing samples for tiie presence of polynucleotides 
from HCV comprising a polynucleotide probe containing a nucleotide sequence of about 8 or more 
nucleotides, wherein the nucleotide sequence is derived from HCV cDNA which is of a sequence indicated 

/5 by nucleotide numbers -319 to 1348 or 8659 to 8866 in Rg. 17. wherein the polynucleotide probe is In a 
suitable container. 

Another aspect of the invention is a kit for analyzing samples for the presence of an HCV antigen 
comprising an antibody which reacts Immunologically with an HCV affrtigen. wherein the antigen contains an 
epitope encoded within HCV cDNA which is of a sequence indicated by nucleotide numbers -319 to 1348 or . 
20 8659 to 8866 in Rg. 17. or wherein the HCV cDNA is in clone 131. or done 26j, or clone 59a. or clone 84a. . ^ 
or clone CA156e, or clone I67b. or clone pi14a. or done CA216a. or clone CA290a. or clone ag30a. or 
clone 205a, or clone 18g, or clone 16ih. 

Yet anotiier aspect of the invention is a kit for analyzing samples for the presence of an HCV antibody 
comprising an antigenic polypeptide containing an HCV epitope encoded witiiin HCV cDNA which is of a 
25 sequence indicated by nucleotide numbers -319 to 1348 or 8859 to 8866 in Rg, 17, or is in ckine 13i. or 
clone 26i. or clone 59a. or clone 84a. or clone CA156e, or clone 167b. or clone pi14a, or clone CA2l6a. or 
clone CA290a. or clone ag30a. or clone 205a, or clone 18g, or done 16jh. 

Another aspect of the invention Is a kit for analyzing samples for the presence of an HCV antibody . 
comprising an antigenic polypeptide expressed from HCV cONA in clone CA279a. or clone CA74a. or clone 
30 13i. or clone CA290a. or clone 33C or clone 40b. or clone 33b, or done 25c, or clone 14c, or clone 8f, or 
clone 33f, or clone 33g, or clone 39c. or clone 15e, wherein the antigenic polypeptide is present in a 
suitable container. 

Still anotiier aspect of the invention is a method for detecting HCV nucleic acids in a sample 
comprising: 

35 (a) reacting nucleic acids of the sample with a polynucleotide probe for HCV, wherein the probe is 
comprised of an HCV sequence derived from an HCV cDNA sequence is of a sequence indicated by 
nucleotide numbers -319 to 1348 or 8659 to 8866 in Rg. 17. and wherein the reacting is under conditions 
which allow the formation of a polynucleotide duplex between the probe and tiie HCV nucleic acid from the 
sample; and (b) detecting a polynucleotide duplex which contains the probe, formed in step (a). 

40 Yet another aspect of the invention Is an immunoassay for detecting an HCV antigen comprising: 

(a) incubating a sample suspected of containing an HCV antigen with an antibody directed against an HCV 
epitope encoded in HCV cDNA. wherein the HCV cDNA is of a sequerice indicated by nucleotide numbers 
-319 to 1348 or 8659 to 8866 in Rg. 17. or is the sequence present in clone 13i. or clone 26i. or clone 59a, 
or clone 84a, or clone CAl56e, or clone 167b. or clone pi14a. or done CA216a. or done CA290a, or clone 

45 ag30a, or clone 205a. or clone 18g. or done 16jh. and wherein the incubating is under conditions which 
allow formation of an antigen-antibody complex; and (b) detecting an anti'body-antigen complex formed in 
step (a) which contains the antibody. 

Stiil another aspect of the Invention is. an immunoassay for detecting antibodies directed against an 
HCV antigen comprising: 

50 (a) incubating a sample suspected of containing anti-HCV antibodies with an antigen polypeptide containing 
an epitope encoded in HCV cDNA, wherein the HCV cDNA is of a sequence indicated by nucleotide 
numbers -319 to 1348 or 8659 to 8866 In Rg. 17. or is tiie sequence present in clone 131. or clone 26j, or - 
clone 59a, or clone 84a. or clone CAl56e. or clone 167b, or done pi14a, or clone CA2 16a, or clone 
V, . CA290a, or clone agSOa, or clone 205a. or clone 18g, or clone 16ih. and wherein the incubating . IS under : 
' ' \55 conditions' which allow formation of an antigen-antibody complex; and detecting an antibody-antigen 

. the invention is a vaccine for treatment of HCV infection comprising an immunogenic^ ^^^^^ 

' ^^^' r f^^Z^^ an HCV epitope encoded in HCV cDNA. wherein the HCV cDNA is of, a sequence " v f 
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indicated by nucleotide numbers -319 to 1348 or 8659 to 8866 in Rg. 17 or is the sequence present in 
clone 131, or clone 26j, or clone 59a. or clone 84a, or done CA156e. or clone 167b, or clone pi14a, or done 
CA216a. or done CA290a, or clone agSOa, or clone 205a, or clone 18g. or clone 16jii, and wherein the 
immunogenic polypeptide is present in a pharmacologically effective dose in a pharmaceutically acceptable 
5 excipient 

Yet another aspect of the invention is a method for producing antibodies to HCV comprising administer- 
ing to an individual an isolated immunogenic polyeptlde containing an HCV epitope encoded in HCV cDNA. 
wherein the HCV cDNA is of a sequence indicated by nucleotide numbers -319 to 1348 or 8659 to 8866 in 
Rg. 17, or is of the sequence present in clone CA279a. or clone CA74a, or done 13i, or clone CA290a, or 

10 clone 33C or clone 40b. or clone 33b, or clone 25c or done 14c, or clone 8f. or clone 33f, or clone 33g, or 
clone 39c. or clone 15e, and wherein the immunogenic polypeptide is present in a pharmacologically 
effective dose in a phamnaceutically acceptable excipi^ 

Stiii another aspect of the invention is an antisense polynucleotide derived from HCV cDNA, wherein the 
HCV cDNA is that shown in Rg. 17. 

15 Yet another aspect of the invention is a method for preparing purified fusion polypeptide 01 00-3 
comprising: 

(a) providing a cnide ceillysate containing polypeptide CI 00-3, 

(b) treating the crude cell lysate with an amount of acetone which causes the polypeptide to 
precipitate, g and solubilizing the precipitated 

20 (c) isolatin material, " 

(d) isolating the CI 00-3 polypeptide by anion exdiange chromatography, and 

(e) further isolating the CI 00-3 polypeptide of step (d) by gel filtration. 

25 Brief Description of the Drawings 

Rg. 1 shows the sequence of the HCV cDNA in done I2f. and the amino acids encoded herein. 

Rg. 2 shows the HCV cDNA sequence in clone k9-1, and the amino acids encoded therein. 

Rg. 3 shows the sequence of clone 15e, and the amino acids encoded therein. 
30 Rg. 4 shows the nucleotide sequence of HCV cDNA in clone 13i. the amino acids encoded therein, 

and the sequences which overlap with clone 12f. 

Rg. 5 shows the nucleotide sequence of HCV cDNA in clone 26], the amino acids encoded therein, 
and the sequences which overlap clone 13i. 

Rg. 8 shows the nucleotide sequence of HCV cDNA in clone CA59a, the amino acids encoded 
35 therein, and the sequences which overiap with clones 26i and K9-1 . 

Rg. 7 shows the nucleotide sequence of HCV cDNA in clone CA84a. the amino acids encoded 
therein, and the sequences which overiap with clone CA59a. 

Rg. 8 shows the nucleotide sequence of HCV cDNA in clone CA156e. the amino acids encoded 
therein, and the sequences which overiap with CA84a. 
40 Rg. 9 shows the nucleotide sequence of HCV cDNA in clone CA167b, the amino acids encoded 

therein, and the sequences which overiap CA156e. 

Rg. 10 shows the nucleotide sequence of HCV cDNA in clone CA216a. the amino acids encoded 
therein, and the overiap with clone CA167b. 

Rg. 11 shows the nucleotide^^equence of HCVcDNA in clone CA290a. the amino acids encoded 
45 therein, and the overiap with clone CA21 8a. 

Rg. 12 shows the nucleotide sequence of HCV cDNA in clone ag30a and the overlap with done 
CA290a,„ 

Rg. 13 shows tho nudeotide sequence of HCVcDNA in clone CA205a. and the overiap with, the HCV 
cDNA sequence in clone CA290a 
50 Rg. 14 shows the nucleotide sequence of HCV cDNA in clone 18g. and the overlap with the HCV. 

cDNA sequence in clone ag30a. 

Rg. 15 shows the nucleotide sequence of HCV cDNA in clone 16jh, the amino acids encoded therein, 
and the overlap of nucleotides vwth the HCVcDlslA sequence in clone 15e. 

Rg. 16 shows the ORF of HCV cDNA derived from clones pi14a; CAie7b. CA156e.- CA84a, CA59a. 
K9-1. 12f. 141. lib. 7f. 7e, 8h, 33c, 40b. 37b. 35. 38, 81. 32. 33b. 25c. 14c. 8f, 33f. 33g, 39c, 35f, 19g, 26g, 
andlSe. 

Rg. 17 shows the sense strand of the compiled HCV cDNA , sequence derived* 
described clones and the comf^ted HCV cDNA sequence published In EPO Pub. No. 3^8^^6. T^^^t^e^^ 
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from which the sequence was derived are b114a* 18g. agSOa, GA2053, C;A290a. CA216a. pi14a. CA167b, 
CA156e, CA84a. CA59a. K9-1 (also called k9-1),26j. 131, 12f, 14i, lib. 7f. 7e^ 8h, 33c. 40b. 37b. 35. 36, 81. 
32, 33b, 25c. 14c, Bf, 33f. 33g, 39c, 35f. 19g. 26g. 15e, bSa, and 1C^. In m figure the three horizontal 
dashes above the sequence indicate the position of the putative initiabn' ndnonine codon; the two vertical 
5 dashes indicate the first and last nucleotides of the published sequm:e. Also shown In the figure is the 
amino acid sequence of the putative polyprotein encoded in the HCV cONA. 

Rg. 18 is a diagram of the immunologlcai colony screening m^btxi used In antiganc mapping 
studies. 

Rg. 19 shows the hydrophobidty profiles of polyproteins encoded in HCV and in West Nile virus. 
10 Rg. 20 is a tracing of the hydrophilicity/hydrophobictty profile and of the antigenic index of the 

putative HCV polyprotein. 

Rg. 21 shows the conserved co-linear peptides in HCV and Ravhrffises. 

Modes for Carrying Out the Invention 

75 

L Definitions 

The tenm "hepatitis C virus" tias been reserved by workers in fhe fiebf for an heretofore unknown. 

20 etiologic agent of NANBH. Accordingly, as used herein, "hepatitis 0 vkus' (HCV) refers to an agent 
causitive of NANBH, which was fbnmerly refen^ed to as NANBV and/or ffl-NANBV. The terms HCV. 
NANBV. and BB-NANBV are used interchangeably herein. As an extension ctf (his terminology, the disease 
caused by HCV. formerly called NANB hepatitis (NANBH). is called hepafib C. The terms KANBH and 
hepatitis C may be used interchangeably herein. 

25 The term "HCV". as used herein, denotes a viral species of which pattiogenic strains cause NANBH. 
and attenuated strains or defective interfering particles derived therefrom. As sfwwn infra., the HCV genome 
is comprised of RNA. It is known that RNA containing viruses have relaively high rates of spontaneous 
mutation, i.e.. reportedly on the order of 10"^ to 10~* per incorpprated nucleotide (Relds & Kntpe (1986)). 
Therefore, there are multiple strains, which may be virulent or avorutent mKi the HCV species described 

30 infra. The compositions and methods desaibed herein, enable the propagaion. identification, detection, and 
isolation of the various HCV strains or isolates. Moreover, the disdostare herein allows the preparation of 
diagnostics and vaccines for the vanous strains, as well as compositions and metfiods that have utility in 
screening procedures for anti-viral agents for pharmacologic use, such ^ agents that inhibit replication of 

HCV. ; 

35 The information pro\Hded herein, although derived from the pioto^rpe strain or isolate of HCV. 
hereinafter refenred to as C0C/HCV1 (also called HCV1), is sufficient to alow a viral taxonomist to Identify 
other strains which fall within the species. The information provided tmeM Aiwrs the belief that HCV is a 
Ftavi-iike virus. The morphology and composition of Ravivirus particies are known, and are discussed in 
Brinton (1986). Generally, with reflect to morphology, Raviviruses cotftean a central nucleocapsid sur- 

40 rounded by a lipid bilayer. Virions are spherical and have a diameter of about 40-50 nm. Their cores are 
about 25-30 nm in diameter. Along the outer surface of the virion env^ope ase projections that are about 5- 
10 nm long with terminal knobs about 2 nm in diameter. 

Different strains or isolates of HCV are expected to contain variations at the amino add and nucleic 
acids compared with the prototype isolate, HCV1. Many isolates are eaqpeded to show much (ije. more than 

45 about 40%) homology in the total amino acid sequence cornpared with KCV1. However, it may also be 
found that other less homologous HCV isolates. These wouki be defined as HCV strains according to 
various criteria such as an ORF of approximately 9,000 nucleotides to pnodmately 12.()0d nucleotides, 
encoding a polyprotein simitar in size to that of HCV1, an encoded polypRlUi of sImilar.hydrDphobic and 
antigenic character to that of HCV1, and the presence of oo-ttnear p^tidb sequences that are conserved 

50 with HCV1 i In addition, the genome would be a positive-stranded RNA. 

HCV encodes at least one epitope which is immunologically identifiabte with an epitope in the HCV 
genome from which the cDNAs described herein are derived; preferaUy fie epitope is contained an amino 
acid sequence described herein, TTie epitope is unique to HCV when compared to other impwh Raviviruses. 
The uniqueness of the epitope may be determined by its immunologfeal neactivity with atfiti-H(JV. antibodies^ 

55 and lack of immunological reactivity with antittodles to other FEaviviius species, li/lethods fci'.determining 
immunological reactivity are known in the art. for example, by radloiraffng w ^a 
hemagglutination, and several examples of suitable techniques for assa#^^ 

In addition to the above, the foSowing parameters of nucleic acid homdbgy #d: .^^^^ 
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are applicable, eHher alone or in combination, in identifying a strain or Isolate as HCV. Since HCV strains 
and isolates are evolutionarily related, it is expected that the overall homology of the genomes at the 
nucleotide level probably will be about 40% or greater, probably at)out 60% or greater, and even more 
probably about 80% or greater; and in addition that there will be corresponding contiguous sequences of at 
5 least about 13 nucleotides. The conrespondence between the putative HCV strain genomic sequence and 
the CDC/HCV1 cDNA sequence can be detenmined by techniques known in the art For example, they can 
be detennlned by a direct comparison of the sequence infonmation of the polynucleotide from the putative 
HCV. and the HCV cDNA sequence(s) described herein. For example, also, they can be detenmined by 
hybridization of the polynucleotides under conditions which fomn stable duplexes between homologous 
10 regions (for example, those which would be used prior to Si digestion), followed by digestion with single 
stranded specific nuclease(3). followed by size determination of the digested fragments. 

Because of the evolutionary relationship of the strains or Isolates of HCV, putative HCV strains or 
isolates are identifiable by their homology at the polypeptide level. Generally, HCV strains or isolates are 
expected to be more than about 40% homologous, probably more than about 70% homologous, and even 
15 more probably more than about 80% homologous, and some may even be more than about 90% 
homologous at the polypeptide level. The techniques for determining amino acid sequence homology are 
known in the art For example, the amino acid sequence may be determined directly and compare,d to the 
sequences provided herein. Alternatively the nucleotide sequence of the genomic material of the putative 
HCV may be determined (usually via a cDNA intermediate), the amino acid sequence encoded therein can 
20 be deterrraned, and the corresponding regions compared. 

As used hen^n. a polynucleotide "derived from" a designated sequence refers to a polynucleotide 
sequence which is comprised of a sequence of approximately at least about 6 nucleotides, preferably at 
least about 8 nucleotides, more preferably at least about 10-12 nucleotides, and even more preferably at 
least about 15-20 nucleotides con^sponding to a region of the designated nucleotide sequence, 
25 "Corresponding" means homologous to or complementary to the designated sequence. Preferably, the 
sequence of the region from which the polynucleotide is derived Is homologous to or complementary to a 
sequence whfch is unique to an HCV genome. Whether or not a sequence is unique to the HCV genome 
can be determined by techniques known to those of skill in the art. For example, the sequence can be 
compared to sequences in databanks, e.g.. Genebank. to detenmine whether it is present In the uninfected 
30 host or other organisms. The sequence can also be compared to the known sequences of other viral 
agents, including those which are known to induce hepatitis, e.g.. HAV. HBV, and HDV. and to other 
members of tfie Flaviviridae. The correspondence or non-correspondence of the derived sequence to other 
sequences can also be determined by hybridization under the appropriate stringency conditions. Hybridiza- 
tion techniques for determining the complementarity of nucleic acid sequences are known in the art. and 
35 are discussed infra. See also, for example, Maniatis et al. (1982). In addition, mismatches of duplex 
polynucleotides formed by hybridization can be determined by known techniques, including for example, 
digestion with a nuclease such as SI that specifically digests single-stranded areas in duplex poly- 
nucleotides. Regions from which typical DNA sequences may be "derived" include but are not limited to, 
for example, regwns encoding specific epitopes, as well as non-transcribed and/or non-translated regions. 
40 The derived polynucleotide is not necessarily physically derived from the nucleotide sequence shown, . 
but may be generated in any manner, including for example, chemical synthesis or DNA replication or 
reverse traiscrlption or transcription. In addition, combinations of regions corresponding to that of the 
designated sequence may be modified in ways known in the art to be consistent with an intended use. 
Similarly, a polypeptide or amino acid sequence "derived from" a designated nucleic acid sequence 
45 refers to a polypeptide having an amino acid sequence identical to tiiat of a polypeptide encoded in the 
sequence, or a portion tiiereof wherein the portion consists of at least 3-5 amino acids, and more preferably 
at least 8-10 amino acids, and even more preferably at least 11-15 amino acids, or which is im- 
munologlcaHy identifiable with a polypeptide encoded in the sequence. l,^^ 
A recombinant or derived polypeptide Is not necessarily translated from a designat§cl"nucleic acid 
50 sequence, for example, tiie HCV cDNA sequences described herein, or frorn an HCV genome;,it may be 
generated In any manner, including for example, chemical synthesis, or expression of a recombinant* 
expression system, or isolation from mutated HCV. A recombinant or derived polypeptide may include one 
or more analogs of amino acids or unnatural amino acids in its sequence. Methods of inserting analogs 6t 
amino acids into a sequence are known in the art It also may .include one or more labels, which are known- . 
55 to those of Skill In tiie art ' ■ / - 

The term "recombinant polynucleotide" as Msed he r elnUn te r^ of .genomic. cDNA, 

semisynthetic, or synthetic origin which, by \^rtUe^d)f fe^^^ 
witfi all or a portion of a polynucleotide with which It Is asscK:i^t^^^^^ 
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other than that to which it is linked in nature, or (3) does not occur in nature. 

The term "polynucleotide" as used herein refers to a polymeric form of nucleotides of any length, either 
ribonucleotides or deoxyribonucleotides. This term refers only to the primary structure of the molecule. 
Thus, this temi includes double- and single-stranded DNA. as well as double- and single stranded RNA. It 
5 also includes known types t)f modifications, for example, labels which are known in the art, methylation, 
"caps", substitution of one or more of the naturally occurring nucleotides with an analog, Intemucleotide 
modifications such as. for example, those with uncharged linkages (e.g.. methyl phosphonates. 
phosphotriesters, phosphoamidates. carbamates, etc.) and with charged linkages (e.g., phosphorothioates. 
phosphorodithioates. etc.). those containing pendant moieties, such as, for example proteins (including for 
10 e.g.. nucleases, toxins, antibodies, signal peptides, poly-L-lysine. etc.). those with intercalators (e-g.. 
acridine, psoralen, etc.). those containing chelators (e.g., metals, radioactive metals, boron, oxidative metals, 
etc.), those containing atkylators. those with modified linkages (e.g.. alpha anomeric nucleic acids, etc.), as 
well as unmodified forms of the polynucleotide. 

The term ^'PMrifled viral polynucleotide" refers to an HCV genome or fragment thereof which is 
IS essentially „Jfee,T^:. contains less than about 50%, greferably less theulled^out 70%. and even more 
preferably less than about 90% of polypeptides with which the viral polynucledti&e "is fiatil^y associate!. 
Techniques for purifying viral polynucleotides from viral particles are known in the art/ and' include for 
example, dismptlon of the particle with a chaotropic agent, differential extraction and separation of the 
polynucleotide(s) and polypeptides by ion-exchange chromatography, affinity chromatography, and sedi- .; 
20 mentation according to density. 

The temn "purified viral polypeptide" refers to an HCV polypeptide or fragment thereof which is 
essentially free, i.e.. contains less than about 50%. preferably less than about 70%, and even more 
preferably less than about 90%, of cellular components with which the viral polypeptide is naturally 
associated. Techniques for purifying viral polypeptides are known in the art, and examples of these 
25 techniques are discussed infra. The temn "purified viral polynucleotide" refers to an HCV genome or ^ 
fragment thereof which is essentially free, i.e.. contains less than about 20%. preferably less than about 
50%. and even more preferably less than about 70% of polypeptides with which the viral polynucleotide is 
naturally associated. Techniques for purifying viral polynucleotides from viral particles are known in the art . 
and include for example, disruption of the particle with a chaotropic agent, and separation of the 
30 polynucleotide{s) and polypeptides by Ion-exchange chromatography, affinity chromatography, and sedi- 
mentation according to density. 

"Recombinant host cells", "host cells", "cells", "cell lines", "cell cultures", and other such terms 
denoting microorganisms or higher eukaryotic cell lines cultured as unicellular entities refer to cells which 
can be, or have been, used as recipients for recombinant vector or other transfer DNA. and include the 
35 progeny of the original cell which has been transfecte^J. It is understood that the progeny of a single > :;t5drr'?- 

parental cell may not necessarily be completely identical in morphology or in genomic or. tctoi DNA 
complement as the original parent, due to natural, accidental, or deliberate mutation. 

A "replicon" is any genetic element, e.g.. a pisismid, a chromosome, a virus, a cosfmidi etc. that 
behaves as an autonomous unit of polynucleotide replication within a celj; i.e.. capable of replication under 
40 its own control. 

A "vector" is a replicon in which another polynucleotide segment is attached, so as to bring alwut the 
replication and/or expression of the attached segment. 

"Control sequence" refers to polynucleotide sequences which are necessary to effect the expression of 
coding sequences to which they are ligated. The nature of such control sequences differs depending upon 
45 the host organism; in prokaryotes, such control sequences generally include promoter, ribosomal binc&ng 
site, and terminators; in eukaryotes, generally, such control sequences include promoters, terminators and, 
in some instances, enhancers. The temn "control sequences" is intended to include, at a minimum, all 
components whose preserKie is necessary for expression, and may also Include additional components 
whose presence is advantageous, for exampleT leader sequences." 
50 "Operably linked" refers to a juxtaposition wherein the components so described are in a relationship 
permitting them to function in their Intended manner. A control sequence "operably linked" to a coding 
sequence is ligated in such a way that expression of the coding sequence is achieved under oonditions 
compatible with the control sequences. , 

An "open reading frame" (ORF) is a region of a, polynucleotide sequence which encodes a polypeptide; - 
55 this region may represent a portion of a coding ^dqueirice or a total coding sequence, • : . ^ 

A "coding sequence* is a PQ^^"^ f ^||^^Sg 9^ ^ transcribed irito m RNA and/or translfaffBd 

into a polypeptide when placed undeV^^ fegulatdry sequences, the boundaries of - 

the coding sequence are deternriined by^^j^ifeoi^^^ the s'-terrhinus and a translation stop ; - 

';•> . .r •'^;i*>'Vv>VV>Va-;* '■ * . - 
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codon at the 3'-terminus. A coding sequence can include* but is not fimited to mRNA. cDNA, and 
recombinant polynucleotide sequences. 

"Immunologically Identifiable with/as" refers to the presence of epitope(s) and polypeptides{s) which are 
also present in the designated potypeptide(s). usually HCV proteins. Immunological identity may be 
5 determined by antibody binding and/or competition in binding; these techniques are known to those of 
average skill in the art, and are also illustrated infra. 

As used herein, "epitope" refers to an antigenic determinant of a polypeptide; an epitope could 
comprise 3 amino acids in a spatial conformation which is unique to the epitope, generally an^ epitope 
consists of at least 5 such amino acids, and more usually, consists of at least 8-10 such amino acids. 
70 Methods of detenmining the spatial conformation of amino acids are known in the art, and include, for 
example, x-ray crystallography and 2-dimensional nuclear magnetic resonance. 

A polypeptide is "immunologically reactive" with an antibody when it binds to an antibody due to 
antibody recognition of a specific epitope contained within the polypeptide. Immunological reactivity may be 
determined by antibody binding, more particularly by tiie kinetics of antibody binding, and/or by competition 
15 in binding using as competitor(s). a known polypeptide(s) containing an epitope against which tiie antibody 
is directed. The techniques for determining whetiier a polypeptide is immunologically reactive witii an 
' antibody are known in the art. 

As used herein, the term "immunogenic polypeptide" is a polypeptide tiiat elicits a cellular arid/or ^ 
humoral re^nse, whether alone or linked to a carrier in the presence or absence of an adjuvant 
20 The term "polypeptide" refers to a polymer of amino acids and does not refer to a specific lengtti of the 
product; thus, peptides, oligopeptides, and proteins are included within tiie definition of polypeptide. This 
term also does not refer to or exclude post-expression modifications of the polypeptide, for example, 
glycosylations. acetylations, phosphorylations and tiie like. Included within tiie definition are. for example, 
polypeptides containing one or more analogs of an amino acid (including, for example, unnatural amino 
25 acids, etc.), polypeptides witii substituted linkages, as well as other modiflcations known in the art both 
naturally occunring and non-naturaliy occun-ing. 

'Transfonmation", as used herein, refers to the insertion of an exogenous polynucleotide into a host cell, 
irrespective of the method used for the insertion, for example, direct uptake, transduction, f-mating or 
electroporation. The exogenous polynucleotide may be maintained as a non-integrated vector, for example. 
30 a plasmid, or alternatively, may be integrated into the host genome. 
'Treatment" as used herein refers to prophylaxis and/or therapy. 

An "individual", as used herein, refers to vertebrates, particularly members of the mammalian species, 
and includes but is not limited to domestic animals, sports animals, and primates, including humans. 

As used herein, the "sense strand" of a nucleic acid contains the sequence that has sequence 
35 homology to that of mR^4A. The "anti-sense strand" contains a sequence which is complementary to tiiat of 
the "sense strand". 

As used herein, a "positive stranded genome" of a virus is one in which ttie genwne. whetiier RNA or 
DNA, is single-stranded and which encodes a viral polypeptide(s). Examples of positive stranded RNA 
viruses include Togaviridae, Coronaviridae, Retrovirldae, Picornavlrldae, and Calicivirldae. Included also, are 
40 the Flaviwridae. which were formerly classified as Togaviradae. See Relds & KnIpe (1986). 

As used herein, "antibody-containing body component" refers to a component of an individual's body 
which is a source of the antibodies of interest Antibody containing body components are known in the art, 
and include but are not limited to, for example, plasma, serum, spinal fluid, lymph fluid, ttie external 
sections of the respiratory, intestinal, and genitourinary tracts, tears, saliva, milk, white bkwd cells, and 
45 myelomas. 

As used herein, "purified HCV" refers to a preparation of HCV which has been isolated tram the cellular 
constituents witii which ,the.jvims is normally associated, and from other types of viruses which may be 
present in the infected tissue. The techniques for Isolating viruses are known to those of skill in tiie art, and 
include, for example, centrifugation and affinity chromatography; a metiiod of preparing purified HCV is 
so discussed infra. 

The tenm "HCV particles"^'W used hereliT include entire "virion as well as particles which are 
intermedlcttes in virion fonmation. HCV particles generally have one or more HCV proteins associated witii 
tiie HCV nucleic acid. 

As used herein, the term; "probe" refers to a polynucleotide which forms a hybrid structure with a 
55 sequence in a target region, due to coniplementarity of at least one sequence in tiie probe with a sequence 
in the tficget^i^igru^M^^^ 
to prime fib pd^ynnerase^ ; ■ 

As used hereinivthe/te^^ to a region of tfie nucleic add which is to be ampJified 

■ /■ . v.i:/v;v^"'':5fe /■■:,'■'.'. -11 • ■^ ■ • : ^ ' ' 
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and/or detected; 

As used herein, the term "viral RNA", which includes HCV RNA. refers to RNA from the viral genome, 
fragments thereof, transcripts thereof, and mutant sequences derived therefrom. 

As used herein, a 'biological sample" refers to a sample of tissue or fluid isolated from an individual, 
5 including but not limited to. for example, plasma, serum, spinal fluid, lymph fluid, the external sections of 
the skin, respiratory, intestinal, and genitourinary tracts, tears, saliva, milk, blood cells, tumors, organs, and 
also samples of in ^o cell culture constituents (including but not limited to conditioned medium resulting 
from the growth of cells in cell culture medium, putatively viraily infected cells, recombinant cells, and cell 
components). 



H. Description of the Invention 

The practice of the present invention will employ, unless othenwise indicated, conventional techniques 
15 of moiecular biology, microbiology, recombinant DNA. and immunology, which are within the skill of the art. 
Such techniques are explained fully in the literature. See 'e.g,. Maniatis. Fitsch & Sambrdpk^. MOLECULAR 
CLONING; A LABORATORY MANUAL (1982); DNA CLONING. VOLUMES I AND II (D.N GloveV ed. 1985); 
OLIGONUCLEOTIDE SYNTHESIS (MJ. Gait ed. 1984); NUCLEIC ACID HYBRIDIZATION (a D. Hames & 
S.J. Higgins eds. 1984); TRANSCRIPTION AND TRANSLATION (B.D. Hames & SJ. Higglns eds. 1984); 
20 ANIMAL CELL CULTURE (RJ. Freshney ed. 1986): IMMOBILIZED CELLS AND ENZYMES^ (IRL Press, 
1986); 8. Perbal. A PRACTICAL GUIDE TO MOLECULAR CLONING (1984); the series, METHODS IN 
ENZYMOLOGY (Academic Press, Inc.); GENE TRANSFER VECTORS FOR MAMMALIAN CELLS (J.H. 
Miller and M.P. Calos eds. 1987. Cold Spring Harbor Laboratory). Methods in Enzymology VoL 154 and Vol. 
155 QNu and Grossman, and Wu, eds,, respectively). Mayer and Walicer. eds. (1987), IMMUNOCHEMICAL 
25 METHODS IN CELL AND MOLECULAR BIOLOGY (Academic Press. London). Scopes. (1987). PROTEIN 
PURIFICATION: PRINCIPLES AND PRACTICE. Second Edition (Springer-Veriag. N.Y.), and HANDBOOK 
OF EXPERIMENTAL IMMUNOLOGY. VOLUMES l-IV (D.M. Weir and C. C. Blackwell eds 1986). All patents, 
patent applications, and publications mentioned herein, both supra and infra, are hereby incorporated herein 
by refanence. 

30 The useful materials and processes of the present invention are made possible by the provision of a 
family of nucleotide sequences isolated from cDNA libraries which contain HCV cDNA sequences. These 
cDNA fibrar ies were derived frijm nucleic acid sequences present in the plasma of an HCV-infected 
chimpanzee. The construction of one of these libraries, the "c" library (ATCC No. 40394). was reported in 
EPO Pub, No. 318,216. Several of the clones containing HCV cDNA reported herein were obtained from the 

35 "c" lltwary. Although other clones reported herein were obtained from other HCV cDNA libraries, the 
presence of clones containing the sequences in the "c" library was confirmed. As discussed in EPO Pub. 
No. 318.216. the family of HCV cDNA sequences isolated from the "c" library are not of human or 
chimpanzee origin, and show no significant homology to sequences contained within the HBV genome. 
The availability of the HCV cDNAs described herein permits the construction of polynucleotide probes 

40 which are reagents useful for detecting viral polynucleotides in biological samples, including donated blood. 
For example, from the sequences it is possible to synthesize DNA oligomers of about 8-10 nucleotides, or 
larger, which are useful as hybridization probes to detect the presence of HCV RNA in, for example, 
donated blood, sera of subjects suspected of harboring the vims, or cell culture systems in which the virus 
is replicating. In addition, the cDNA sequences also allow the design and production of HCV specific 

4S polypeptides which are useful as diagnostic reagents for tiie presence of antibodies raised during HCV 
infectioa Antibodies to purified polypeptides derived from the cDNAs may also be used to detect viral - 
antigens in biological samples, including, for example, donated blood samples, sera from patients with 
NANBH, and in. tissue culture systems being used for HCV replication. Moreover, the immunogenic 
polypeptides disclosed herein, which are encoded in portions of ttie ORF of HCV cDNA shown in Fig. 17. 

so are atep useful for HCV screening, diagnosis, and tiiBatment. and for raising antibodies which are also useful 
for th^ purposes. 

In addition, the novel cDNA sequences described herein enable further characterization of tiie HCV 
genome. Polynucleotide probes and primers derived from tfiese sequences may be used to amplify 
sequences present in cDNA libraries, and/or to screen cDNA libraries for additional overlapping cDNA 
55 ^ sequenceis, Aftrtiich, in turn, may be used to obtain more overiapping sequences. As indicated infra, and in - 

the genome of HCV appears to be RNA conriprised primarily of a large ppen:. 
;.'^^^^rW^pS5 3 '3^® polypratein. ; / ' 

\^ provided herein, the polypeptides derived from these sequences^ 'and ttie 
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' "immuhc^erl^ polypeptides described lierein.ts welPag antibodies directed against these polypeptides are 
also useful in the isolation and identification <f the blood-borne NABV (BB-NANBV) agent(s). For example, 
antibodies directed against HCV epitopes cortained in polypeptides derived froxn the cDNAs may be used 
in processes based upon affinity chromatogr^hy to isolate the virus. Afternatively. the antibodies may be 

5 used to identify viral particles isolated by oter techniques. The viral antigens and the genomic material 
within the isolated viral particles may then be iirther characterized. 

In addition to the above, the information pDvided infra allows the identification of additional HCV strains 
or isolates. The isolation and characterization if the additional HCV strsuns or isolates may be accomplished 
by isolating the nucleic acids from body compnents which contain viral particles and/or viral RNA. creating 

10 cDNA libraries using polynucleotide probes bced on the HCV cDNA probes described infra., screening the 
libraries for clones containing HCV cDNA seqences described infra., and comparing the HCV cDNAs from 
the new isolates with the cDNAs described inia. The polypeptides encoded therein, or in the viraLgenome, 
may be monitored for immunological cross-Bactivity utilizing tlie polypeptides and antibodies described 
supra. Strains or isolates which fit within theparameters of HCV. as described in the Definitions section. 

75 supra., are readily identifiable. Other methodsfor identifying HCV strains will be obvious to those of skill in 
the art, based upon the information provided hrein. : • 



isolation of the HCV cDNA Sequences 

20 

The novel HCV cDNA sequences descbed infra, extend the sequence of the cDNA to the HCV 
genome reported in EPO Pub. No. 318.216. He sequences which are fMBsent in clones b114a, 18g, ag30a. 
CA205a. CA290a, CA216a, pi 14a, CA167b. CA156e, CA84a. and CA59a lie upstream of the reported 
sequence, and when compiled, yield nucleotites nos. -319 to 1348 of the composite HCV cDNA sequence. 

25 (The negative number on a nucleotide indictes its distance upstream of the nucleotide which starts the 
putative initiator MET codon.) The sequenceswhich are present in clones t>5a and lejh lie downstream of 
the reported sequence, and yield nucleotides los. 8659 to 8866 of the composite sequence. The composite 
HCV cDNA sequence which includes the seqtfinces in the aforementioned clones, is shown in Rg. 17^ 
The novel HCV cDNAs described herein lere isolated from a number of HCV cDNA libraries, including 

30 the "c" library present In lambda gtl 1 (ATCCNo. 40394). The HCV cDNA libraries were constructed using 
pooled serum from a chimpanzee with chronit HCV infection and containmg a high titer of the virus, i.e.. at 
least 10^ chimp infectious doses/ml {CID/mI).The pooled serum was used to isolate viral particles; nucleic 
acids isolated from these particles was usedas the template in the construe tion of cDNA libraries to the 
viral genome. The procedures for Isolation of futative HCV particles and for constructing the "c" HCV cDNA 

35 library is described in EPO Pub. No. 318.216. Other methods for constructing HCV cDNA libraries are 
known in the art. and some of these metfuds are described infra., in the Examples. Isolation of the 
sequences was by screening the libraries u^g synthetic polynucleotide probes, the sequences of which 
were derived from the s'-region and the s'-ngion of the known HCV cDNA sequence. The description of 
the method to retrive the cDNa sequences i mostly of historical Interest. The resultant sequences (and 

40 their complements) are provided herein, an( the sequences, or any portion thereof, could be prepared 
using synthetic methods, or by a combinationof synthetic methods with retrieval of partial sequences using 
methods similar to those described herein. 



45 Preparation of Viral Polypeptides and Fragmeits 



The availability of HCV cDNA sequenes, or nucleotide sequences derived therefrom (including 
segments and modifications of the sequeno), permits the construction of expression vectors encoding 
' antigenicaliy active regions of the polypeptidi encoded in either strand. These antigenically active regions 
50 may be derived from coat or envelope antipns or from core antigens, or from antigens which are norh 
structural including, for example, poiynucleotle binding proteins, polynucleotide polymerase(s), and other 
viral proteins required for the replication antor assembly of the vims particle. Fragments encoding the 
desired polypeptides are derived from the cDNA dones using conventional restriction digestion or by^ 
synthetic methods, and are ligated into vctors which may, for example, contain portions of fusion;. 
55 sequences such as beta-galactosidase or siperoxide dismutase (SOD), preferably SOD. Methckis and ',^ 
^^y^tors ;,vvhich^ a^^ useful for the production if polypeptides which contain fusion sequence^ pf<S0p^ 
'f3escri6ed in European Patent Office Publicatan number 0196056, pubSshed October 1, l986; y€lc:tof^ 
' the expression of fusion polypeptides of SOD and HCV polypeptides encoded In a number of HGV ciones|^^^ 

^ ■ ■ ' ■ 13 " -. ■ 
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are described infra., in the Examples. Any desired portion of the HCV cDNA containing an open reading 
frame, in either sense straKl. can be obtained as a recombinant polypeptide, such as a mature or fusion 
protein; alternatively, a polypeptide encoded in the cDNA can be provided -by chemical synthesis. 

The DNA encoding He desired polypeptide, whether In fused or mature form, and whether or not 
5 containing a signal sequeice to permit secretion, may be ligated into expression vectors suitable any 
convenient host Both eularyotic and prokaryotic host systems are presently used in forming recombinant 
polypeptides, and a sumnary of some of the more common control systems and host cell lines is given 
infra. The polypeptide is ttan isolated from lysed cells or from the culture medium and purified to the extent 
needed for its intended use. Purification may be by techniques known in the art. for example, differential 
70 extraction, salt fractionatiai. chromatography on ion exchange resins, affinity chromatography, centrifuga- 
tion,.^nd the like. See. forexample, Methods in Enzymology for a variety of methods for purifyinp. proteins. 
Such polypeptides can beused as diagnostics, or those which give rise to neutralizing ahtiBodies may be 
fomnuiated into vaccines- lintibodles raised against these polypeptides can also be used as diagnostics, or 
for passive immunothera^. In addition, as discussed infra., antibodies to these polypeptides are useful for 
. 75 isolating and identifying HCV particles. , 



Preparation of Antigenic RIypeptides and Conjugation with Carrier 

20 An antigenic region ola polypeptide is generally relatively small-typically 8 to 10 amino acids or less 
in length. Fragments of a few as 5 amino acids may characterize an aitigenic region. These segments 
may correspond to regiois of HCV antigen. Accordingly, using the cDNAs of HCV^as a basis, DNAs 
encoding short segments ef HCV polypeptides can be expressed recombinantty either as fusion proteins, or 
as isolated polypeptides, h addition, short amino add sequences can be conveniently obtained by chemical 
25 synthesis. In instances wterein the synthesized polypeptide is correctly configured so as to provide the 
correct epitope, but is toosmall to be immunogenic, the polypeptide may be linked to a suitable carrier. 

A number of techniqies for obtaining such linkage are known in the art. including the formation of 
disulfide linkages using N-succinimldyI-3-(2-pyridyIthio)proplonate (SPDP) and succinimidyl 4-{N-mal- 
eimidomethyl)cyclohexane-1-carboxylate (SMCC) cA»tained from Pierce Company, Rockfdrd, Illinois, (if the 
30 peptide lacks a sulfhydryl group, this can be provided by addition of a cysteine residue.) These reagents 
create a disulfide linkage between themselves and peptide cysteine residues on one protein and air amide 
linkage through the epsiln-amino on a lysine, or other free amino group in the other. A variety of such 
disulfide/amide-forming ^nts are known. See, for example, Immun. Rev, (1982) 62:185. Otiier bifunctional 
coupling agents form a trtwtiier rather tiian a disulfide linkage. Many of fliese thio-ether-fonming agents are 
35 commercially available ad include reactive esters of 6-maleimldocaproic acid. 2-bromoacetic acid, 2- 
iodoacetic acid. 4-(N-malamidometiiyl)cyclohexane-1-carboxylic acid, and the like. The carboxyl groups can 
be activated by combirtig tiiem witti succinimide or 1-hydroxyl-2-rtitro-4-sulfonic acid, sodium salt. 
Additional methods of copling antigens employs the lotavirus/^bindjng peptide" system described in EPO- 
Pub. No. 259.149, tiie dBclosure of which is Incorporated herein by reference. The foregoing list is not 
40 meant to be exhaustive, ad modifications of tiie named compounds can clearly be used. 

Any carrier may be ued which does not itself Induce the production of antibodies harmful to the host. 
Suitable carriers are typiaJly large, slowly metabolized macromolecules such as proteins; polysaccharides, 
such as latex functionatiad sepharose. agarose, cellulose, cellulose beads and the like: polymeric amino 
acids, such as polyglutanic acid, polylysine, and the like; amino ac^ copolymers; and inactive virus 
45 particles. Especially usdul protein substrates are serum albumins. Ikeyhole limpet hembcyanin. im- 
munoglobulin molecules, thyroglobuiln, ovalbumin, t^anus toxoid, and ©ther proteins well know<ii»to tiiose 
skilled in the art. 

^ ^ In addition to fulHenjlh viral proteins, polypeptides comprising truncated HCV amjno acid sequences 

encoding at least one vir< epitope are useful immunological reagents. Rsr example. poiypepti^§l"S3^m|)ris- >t 
50 ing such truncated sequences can be used as reagents in an immunoassay. These polypeptides also are 
candidate subuntt antigeis in compositions for antiserum production m vaccines. While these truncated ^ 
sequences can be produed by various known treatments of native viraL» protein, it Is generally prefeaed to 
make synthetic or recomWnant polypeptides compris^g an HCV sequemce. Polypeptides comprising these * 
ti'uncated HCV sequence can be made up entirely of HCV sequentnes (one or more epitopes. V^tth^ 
55 contiguous or noncontigiDus), or HCV sequences and heterologous secpences In a fusiori^prbtein.:^^ 
s^^;^"; - heterologous sequences ixslude sequences that provide for secretion froim a recombinant bd^^^ 

* ' immunological reactivity of the HCV epltope(s), or fecilitate the omupllng ' pf^. f he^^^^ 

; ■ ^ ' ' immunoassay support ora vaccine carrier. See. e.g.. EPO Pub. No. 1161201; U.S. 
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Pub. No. 259.149; U.S. Pat. No. 4,629,783. the disclosures of which are incorporated herein by refere?rc0. 

Thi size of polypeptides comprising the truncated HCV sequences can vary wideiy. the minimum size 
being a sequence of sufficient size to provide an HCV epitope, while the maximum size is not critical. For 
converfence, the maximum size usually is not substantially greater than that required to provide the desired 
5 HCV eiiitopes and funct!on(s) of the heterologous sequence, if any. Typically, the taincated HCV amino acid 
sequen» will range from about 5 to about 100 amino acids in length. More typically, however, the HCV 
sequence will t>e a maximum of about 50 amino acids in length, preferably a maximum of about 30 amino 
acids. I is usually desirable to select HCV sequences of at least about 10. 12 or 15 amino adds, up to a 
maximim of about 20 or 25 amino acids. 

10 Tnncated HCV amino add sequences comprising epitopes can be identified in a number of ways. For 
exampb, the entire viral pro&ein sequence can be screened by preparing a series of short peptides that 
together span the entire protein sequence. An example of antigenic screening of the regions of the HCV 
polypr<tein is shown infra. In sddition, by starting with, for example, lOOmer polypeptides, it would be 
routineto test each polypeptide for the presence of epitope(s) showing a desired reactivity, and then testing 

75 progreisively smaller and overlapping fragments from an identified lOOmer to map the epitope of interest. 
Screering such peptides in an inrimunoassay is within the skill of the art It is also known to carry out a 
compulsr analysts of a protein sequence to identify potential epitopes, and then prepare oligopeptides 
compriang the identified regions for screening. Such a conjputer analysis of tiie HCV amino acid sequence 
is shovn in Rg. 20. where the hydrophilic/hydrophobic character is displayed above the antigen index. The 

20 amino Bdds are numbered from the starting MET (position 1) as shown in Rg. 17. It is appreciated by tiiose 
of skill in tiie art that such computer analysis of antigenicity does not always identify an epitope tiiat 
actual Ij exists, and can also mcorrectiy identify a region of the protein as containing an epitope. Examples 
of HCVamino add sequences that may be useful, which are expressed from expression vectors comprised 
of clones 5-M. 81, CA74a. 35f. 279a. C36, C33b. CA290a, C8f, C12f. 14c. 15e. C25c, C33c. C33f. 33g, 

25 C39c. C40b, CA167b are described infra. Otiier examples of HCV amino acid sequences that may be useful 
as des:ribed herein are set forth below. It is to be understood that tiiese peptides do not necessarily 
precisdy map one epitope, and may also contain HCV sequence tiiat is not immunogenic. These' non- 
immunogenic portions of the sequence can be defined as described above using conventional techniques 
and ddeted from the described sequences. Further, additional truncated HCV amino acid sequences tiiat 

30 comprte an epitope or are immunogenic can be identi'fied as described above. The following sequences 
are gi\en by amino add numt)er (i.e., "AAn") where n Is the amino acid number as shown in Fig. 17: 
AA1-AA25; AA1-AA50; AA1^AA84; AA9-M177: AA1-AA10; AA5-AA20; AA20-AA25; AA35-AA45; AA50" 
AA10Q AA40-M90: AA45-AA65: AA65-AA75; AA80-90; AA99-AA120; AA95-AA110; AAl65-AA120; AAIOO- 
AA15QAA150-AA200; AAt55^170; AA190-AA210; AA200-AA250; AA220-AA240; AA245-AA265; AA250- 

35 AA30Q AA290-AA330; AA29(>-305: AA300-AA350: AA310-AA330; AA350-AA400; AA380-AA395; AA405- 
AA495;AA400-AA450; AA405-AA415: AA415-AA425; M425-AA435; AA437-AA582: AA450-AA500; AA440- 
AA46QAA460-AA470; AA4^AA495: AA500-AA550: AA511-AA69a; AA515-AA550; AA550-AA600: AA550- 
AA62SM575-AA605; AA585-AA600: AA600-AA650; AA600-AA625: AA635-AA665; AA650-AA700; AA645- 
AA68QAA700-AA750: AA70D-AA725: AA700-AA750: AA725-AA775; AA770-AA790; AA750-AA800; AA80G- 

40 AA815 AA825-AA850: AA850-AA875; AAB00-AA85O: AA920-M990; AA850'AA900: AA920-AA945; AA940- 
AA965 AA970-AA990, A^O-AAIOOO; AA1000-AA1060; AA1000-AA1025; AA1 000-AA1 050; AAi025- 
AA104t AA1040-AA1055: AA1075-AA1175; AA1050-AA1200: M1070-AA1100: AA1100-AA1130; AA1140- 
AAlieS; AA1192-AA1457; AA1195-AA1250; AA1200-AA1225: AA1225-AA1250: AA1250-AA1300; AA1260- 
AA131B: AA1260-AA12a0; AA1266-AA1428; AA1300-AA1350: M1290-AA1310; AA1310-AA1340; AA1345- 

45 AA1405: AA1345-AA1365; AA135Q-AA1400; M1365-AA1380; AA1380-AA1405: AA1400-AA1450: AA1450- 
AAISOO; AA1460-AA1475; AA1475-AA1516; AA1475-AA1500; AA1500-AA1550: AA150O-AA1515; AA1515- 
AA15SD; AA1550-AA1600; AA1545-AA1560; AA1569-AA1931; AA1570-AA1590: AA1595-AA1610; AA1590- 
AA16SD: AA1610-AA1645; AA1650-AA1690; AA1685-AA1770: AA1689-AA1805; AA1690-AA1720; AA1694-^ 
AA1735: AA1720-AA1745; AA1745-AA1770; AA1750-AA1800: M1775-M1810rM1795-AA1850; AAI^^^^ 

50 AAigOD; AA190O-AA1950;"AA190O-AA1920; AA1916-AA2021; Mt920-M1940;.Ml949-AA2124; A^^^ 
AA20m; AA1950-AA1985; AA1980-AA2000; AA2000-AA2050; AA2005-AA2025: M2020^AA2045; AAi?045- 
AA2ia): AA2045-AA2070; AA2054-AA2223; AA2070-AA2100; AA2100-AA2t50; AA2150-AA2200: AA2200- 
AA229): AA2200-AA2325; AA2250-AA2330; AA2255-AA2270; AA2265-AA22e0: AA2280-AA2296; AA2287- 
AA23a5; AA2300-AA2350; AA229D-AA2310; AA2310-AA2330; AA2330-AA2350: AA2350-AA2400: AA2348-: 

55 AA24a: AA2345-AA2415; AA2345-AA2375; AA2370-AA2410: AA2371-AA2502; AA240b-M2456; )^ 
AA2425; AA2415-AA2450; AA2445-AA2500; AA2445-AA2475; AA24Z07AA?490^ AA250C>rAi\2550; AA2i505- 
AA254): AA2535-AA2560; AA2550-AA2600; AA25eO-AA2586;^^^i^^ 

AA26a); AA2640-AA2660; AA2650-AA2700; AA2655-AA2670: M2^ AA2740- 
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AA2760; AA2750-AA2800: AA2755-AA2780; AA2780-AA2830; AA2785-AA2810; AA2796-AA2886: AA2810- 
AA2825; AA2800-AA2850; AA2850-AA2900: AA2850-AA2865; AA2885-AA2905; AA2900-AA2950; AA2910- 
AA2930; AA2925-AA2950: AA2945-end(C' terminal). 

The above HCV amino acid sequences can be prepared as discrete peptides or Incorporated into a 

5 larger polypeptide, and may find use as described herein. Additional polypeptides comprising truncated 
HCV sequences are described in the examples. 

The observed relationship of the putative polyproteins of HCV and the Flaviviruses allows some 
prediction of the putative domains of the HCV "non-structural" (NS) proteins. The locations of the individual 
NS proteins in the putative Flavivirus precursor polyprotein are fairly well-known. Moreover, these also 

10 coincide with observed gross fluctuations in the hydrophobicity profile of the polyprotein. It Is established 
that NS5 of Flaviviruses encodes the virion polymerase, and that NS1 corresponds with a complement 
fixation antigen which has been shown to be an effective vaccine in animals. Recently, it has been shown 
that a flaviviral protease function resides in NS3. Due to the observed similarities betfl^en HCV and the 
Flaviviruses. described infra., deductions concerning the approximate locations of the corresponding protein 

ts domains and functions in the HCV polyprotein are possible. The expression of polypeptides containing 
these domains in a variety of recombinant host cells, including, for example, bacteria, yeast, insect, ap^. 
vertebrate cells, should give rise to important immunological reagents which can be used for diagriosis, 
detection, and vaccines. 

Although the non-structural protein regions of the putative polyproteins of the HCV isolate described 
20 herein and of Flaviviruses appear to have some similarity, there is less similarity between the putative 
structural regions which are towards the N-terminus. In this region, there is a greater divergence in 
sequence, and in addi tion, the hydrophobic profile of the two regions show less similarity. This 
"divergence" begins in the N-terminal region of the putative NS1 domain in HCV, and extends to the 
presumed N-terminus. Nevertheless, it may still be possible to predict the approximate locations of the 
25 putative nucleocapsid (N-terminal basic domain) and E (generally hydrophobic) domains within the HCV 
polyprotein. In the Examples the predictions are based on the changes observed in the hydrophpbic profile 
of the HCV polyprotein, and on a knowledge of the location and character of the flaviviral proteins. From 
these predictions it may be possible to identify approximate regions of the HCV polyprotein that could 
correspond with useful immunological reagents. For example, the E and NS1 proteins of Flaviviruses are 
30 known to have efficacy as protective vaccines. These regions, as well as some which are shown to be 
antigenic in the HCV isolate described herein, for example those within putative NS3, C, and NS5. etc., 
should also provide diagnostic reagents. Moreover, the location and expression of viral-encoded enzymes 
may also allow the evaluation of anti-viral enzyme inhibitors, i.e., for example, inhibitors which prevent 
enzyme activity by virtue of an interaction with the enzyme itself, or substances which may prevent 
• 35 expression of the enzyme, (for example, anti-sense RNA. or other drugs which interfere with expression). 



Preparation of Hybrid Particle Immunogens Containing HCV Epitopes 

40 The immunogenicity of the epitopes of HCV may also be enhanced by preparing them in mammalian or 
yeast systems fused with or assembled with particle-forming proteins such as, for example, that associated 
with hepatitis B surface antigen. Constructs wherein the NANBV epitope is linked directly to the particle- 
forming protein coding sequences produce hybrids which are immunogenic with respect to the HCV 
epitope. In addition, all of the vectors prepared include epitopes specific to HBV. having various degrees of 

45 immunogenicity. such as. for example, the pre-S peptide. Thus, particles constructed from particle fonning. 
protein which include HCV sequences are immunogenic with respect to HCV and HBV. 

Hepatitis surface antigen (HBSAg) has been shown to be formed and assembled into particles in S. " 
cerevisiae (Valenzueia et al. (1982)). as well as in,. for*#xampIe(^ammalian cells (Valenzuela, P.. et al. 
(1984)). The formation of such particles has been shown to enhance the imhnu1nbgenicify*16f the monomer 

50 subunit The constructs may also include the immunoddnriinant ispltope of HBSAg, comprising the 55 amino 
acids of the presurface (pre-S) region. Neurath et al. (1984). Constructs of the pre-S-HBSAg particle 
expressible in yeast are disclosed in EPO 174,444, published March t9, 1986; hybrids including heterolo- 
gous viral sequences for yeast expression are disclosed in EPO 175,261, published March 26, 1966. These 
constructs may also be expressed in mammalian cells such as Chinese hamster ovary (CHO) cells using an 

55 SV40-dihydrofblate reductase vector (Michelle et al. (1984))'. 

In addition, portions ^r^"^ Jf^^^ replaced with codons 

encoding an HCV epitope. In this replacenrierijl^^^on^^ the aggregation of ; 

the units to form immunogenic particles inf y§a)5t|^rf^^^^^ thus eliminating additional ; 
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HBV antigenic sites from competition witii the HCV epitope. • "^-y^ 



Preparation of Vaccines 

Vaccines may be prepared from one or more immunogenic polypeptides derived from HCV cDNA. 
including the cONA sequences described in the Examples. The observed homology between HCV and 
Flaviviruses provides information concerning the polypeptides which may be most effiective as vaccines, as 
well as the regions of the genome in which they are encoded. The general structure of the Ravivirus 

10 genome is discussed in Rice et al (1986). The flavlvirus genomic RMA is believed to be the only virus- 
specific mRNA species, and it is translated into the three viral structural proteins, i.e., C. M, and E. as well 
as two large nonstructural proteins, NS4 and NS5, and a complex set of smaller nonstructural proteins. It Is 
known that major neutralizing epitopes for Flaviviruses reside in the E (envelope) protein (Roehrig (1986)). 
Thus, vaccines may be comprised of recombinant polypeptides containing epitopes of HCV E, These 

75 polypeptides may be expressed In bacteria, yeast, or mammalian ceils, or alternatively may be isolated 
from viral preparations. It is also anticipated that the other sfructural proteins may also ccHttaln epitopes 
which give rise to protective anti-HCV antibodies. Thus, polypeptides containing the epitopes of C. and Mr 
may also be used, whether singly or In combination, in HCV vaccines. 

In addition to the above, it has been shown that immunization with NS1 (nonstmcturai protein 1), results 

20 in protection against yellow fever (Schlesinger et al (1986)). This is true even though the immunization does 
not give rise to neutralizing antibodies. Thus, particularly since this protein appears to be hi^ty conserved 
among Raviviruses. it is likely that HCV NS1 will also be protective against HCV infectian. Moreover, it also 
shows that nonstructural proteins may provide protection against viral pathogenicity, even if they do not 
cause the production of neutralizing antibodies. 

25 The information provided in the Examples conceming the immunogenicity of the polypeptides ex- 
pressed from cloned HCV cDNAs which span the various regions of the HCV ORF also alltgis predictions 
concerning their use in vaccines. • 

In view of the above, multivalent vaccines against HCV may be comprised of one or mcMi? epitopes 
from one or more sfructural proteins, and/or one or more epitopes from one or more nonstrucbiral proteins. 

30 These vaccines may be comprised of. for example, recombinant HCV polypeptides and/or polypeptides 
isolated fram the virions, in particular, vaccines are contemplated comprising one or more of the foltowing 
HCV proteins, or subunit antigens derived therefrom: E, NS1 , C. NS2. NS3. NS4 and NS5. Particularly 
preferred are vaccines comprising E and/or NS1 . or subunits thereof. 

The preparation of vaccines which contain an immunogenic polypeptide(s) as active ingredients, is 

3S known to one skilled in the art. Typically, such vaccines are prepared as injectables, either as liquid 
solutions or suspensions; solid forms suitable for solution in. or suspension In. liquid prior to Inioction may 
also be prepared. The preparation may also be emulsified, or the protein encapsulated in liposomes. The 
active immunogenic ingredients are often mixed with excipients which are pharmaceutically acceptable and 
compatible with the active ingredient. Suitable excipients are, for example, water, saline, dextrose, glycerol. 

40 ethanol, or the like and combinations thereof. In addition, if desired, the vaccine may contain minor amounts 
of auxiliary substances such as wetting or emulsifying agents, pH buffering agents, and/or adOuvants which 
enhance the effectiveness of the vaccine. Examples of adjuvants which may be effective include but are not 
limited to: aluminum hydroxide. N-acetyl-muramyl-L-threonyl-D-isoglutamine (thr-MDP), N-acetyl-nor- 
muramyl-L-alanyl-D-isoglutamine (CQP 11637. refenred to as nor-MDP). N-acetylmuramyl-L-alanyl-D- 

45 isoglutaminyl-L-alanine-2-(1 '-2'-dipalmitoyi-sn-glycero-3-hydroxyphosphoryloxy)-ethylamine (CGP 1983.5A. 
referred to as MTP-PE). and RIBI. which contains three components extracted from bacteria, mon- 
ophosphoryl lipid A. frehalose dimycolate and cell wall skeleton (MPL+TDM+CWS) in a 2% 
squalene/Tween 80 emulsion. Tt|e^,effectiveness of an adjuvant may be determined by measuring the 
amount of antibodies directed agatrist an' irhmunogenic polypeptide containing an HCV antig^c sequence 

50 resulting from administration of this polypeptide in vaccines which are also comprised of the varibiis . 
adjuvants. 

The vaccines are conventionally administered parenterally. by injection, for exampte, either sub- " 
cutaneously or intramuscularly. Additional formulations which are suitable for other modes of adminisfration 
include suppositories and. in some cases, oral formulations. For suppositories, fradltidn^ biriders and 
55 carriers may include, for example, poly alky lene glycols or triglycerides; such suppositories may be formed . 
from m jxt,ures.^,Qontgujrnn^^ in the range of 0^5% to 10%. preferably 1%-2%. Oral 

formulations' in^pide for example, phanmaceutical grades .of 

mannitoi. lactose, starch*;rTiagnW).uifji sacch^ne. cellulose, magnesium cartxmate. and the , 



EP0 388 232A1 



like. These oompo^ttons take the farm of solutions, suspensions, tablets, pills, capsules, sustained release 
formulations or powders and contain 10%-95% of active ingredient, preferably Z5%-70%. 

The proteins may be formulated into the vaccine as neutral or salt forms. Pharmaceutically acceptable 
salts include the add addtion salts (farmed with free amino groups of the peptide) and which are formed 
5 with inorganic adds such as. for example, hydrochloric or phosphoric adds, or such organic acids such as 
acetic. oxaBc. tartaric, maleic. and the like. Salts formed with the free carboxyl groups may also be derived 
from inorganic bases such as, for example, sodium, potassium, ammonium, calcium, or ferric hydroxides, 
and sudi organic bases as isopropytamine, trimethylamine. 2-ethylamino ethanol. histidine. procaine, and 
the tike. 



Dosage and Administration of V«x:ines 

The vacdnes are administered in a manner compatible with the dosage formulation, and in such 
IS amount zs will be prophyiactically and/or therapeutically effective. The quantity to be administered, which is 
generally in flie range of 5 micrograms to 250 micrograms of antigen per dose, depends on the subjeqt to, 
be treated, c^^adty of the subject's immune system to synthesiEe antibodies, and thejdegree of protection ' 
desired. Predse amounts of actwe ingredient required to be administered may depend on the judgmerit of 
the pradBion^ and may be peculiar to each subject 
20 The vaodne may be given rn a single dose schedule, or preferably In a multiple dose schedule. A 
multiple dose schecfole Is one in which a primary course of vaccination may be with 1-10 separate doses, 
followed by other ctoses given at subsequent time intervals required to maintain and or reenforce the 
immune response, for example, at M months for a second dose, and if needed, a subsequent dose(s) after 
several rmmths. The dosage regimai will also, at least in part be detennined by the need of the individual 
25 and be dependent upon the judgment of the practitioner. 

In acfafition, the vaccine containing tine immunogenic HCV antigen{s) nriay bo administered in conjunction 
with oth^ unmunoiBgulatDry agents, for example, immune globulins. * 



30 Preparation of Antitxxjles Against HCV Epitopes 



The Bnmunogeruc polypeptides prepared as described above are used to produce antibodies, both 

polydonad and monoclonal, if polycbnal antibodies are desired, a selected mammal (e.g.. mouse, rabbit. 

goat horse, etc.) is immunized wifli an immunogenic polypeptide bearing an HCV epitope(s). Serum from 
35 the immunized animal is collected and treated according to known procedures. If serum containing 

polyclcral antibodies to an HCV efxtope contains antibodies to otiier antigens, the polyclonal antibodies cari 

be purifled by immunoaffinity chromatography. Techniques for producing and processing polyclonal^ 

antisera are known in the art, see for example, Mayer and Walker (1 987). 

Aftematively, polyclonal antibotfies may be isolated from a mammal which has been previously Infected 
40 with HCV. An example of a method for purifying antibodies to HCV epitopes from serum from an infected 

indivkti^ based upon affinity diromatography and utilizing a fusion polypeptide of SOD and a polypeptide 

encoded wttiin cONA done 5-1-1 , is presented in EPO Pub. No. 318,216. 

Monodonal antibodies directed against HCV epitopes can also t>e readily produced by one skilled in 

the art The general methodology for making monoclonal antibodies by hybridomas is well known. Immortal 
4S antibodyivodudng cell Bnes can be created by cell fosion. and also by other techniques such as direct 

transfonnnation of B lymphocytes vrith oncogenic ONA or transfection with Epstein-Barr virus. Sise, e.g.. M. 

Schr^^al. (1980); Hammerling et al. (1981): Kennettet al. (1980): see also. U.S. Patent Nos. 4.341,761; 

4.399,121; 4.427.7K; 4,444.8ai^;^4.466.917: 4,472.500; 4,491,632; and 4.483.890. Panels of monoclonal 

antibocfes produced against HCV epitopes can be screened for various properties; i.e., for isotype. epitope 
so affinity, etc. 

Anybodies. fcKifh monoclonal and polyclonal, which are directed against HCV epitopes are particularly 
useful in dlsgno^. and those v^ach are neutralizing are useful in passive immunotherapy. Monoclonal ^ 
antibCKfies, In particular, may be used to raise anti-idiotype antibodies. ^ ^ 

Anerkliotype antibodies are immunoglobulins which carry an "internal image" of the antigen of the 
65 infectious agent- 'again desired. See. for example. Nisonoff, A., et a^ (1981) and 

-f?^^^^ antibodies are known in tfie art See. for example. Girzych (1985), 

r :Ma^^ Uytdehaag etal. (1985). These anti-idiotype antibodies may also t>e useful for 
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treatment and/or diagnosis of NANBH, as well as for an elucidation of the immunogenic regions of HCV 
antigens. o 

It would also be recognized by one of ordinary skill in the art that a variety of types of antibodies 
directed against HCV epitopes may be produced. As used heriu, the terni "antibody" refers to a 
5 polypeptide or group of polypeptides which are comprised of at least one antibody combining site. An 
"antibody combining site" or "binding domain" is fbnmed from ttie folding of variable domains of an 
antibody motecule(s) to form three-dimensional binding sf^ces witiian internal surface shape and charge 
distribution complementary to the features of an epitope of an artigen, which allows an immunological 
reaction with the antigen. An antibody combining site msy be fonned from a heavy and/or a light chain 
10 domain (VH and VL, respectively), which form hypervariable loops vidiich contribute to antigen binding. The 
term "antibody" includes, for example, vertebrate antibodies, hybrid antaxulles, chimeric antibodies, altered 
antibodies, univalent antibodies, the Fab proteins, and single domain ant9>odies. 

A "single domain antibody" (dAb) is an antibody which is con^rised of an VH domain, which reacts 
immunologically with a designated antigen. A dAB does not contain a VL domain, but may contain other 
15 antigen binding domains known to exist in antibodies, for examjUe. the kappa and lambda domains. 
Methods for preparing dABs are known in the art See, for ^cample, Ward et al. (1989). . * 

Antibodies nriay also be comprised of VH and VL dwnains. as well as otiier known antigen binding ; 
domains. Examples of these types of antibodies and methods for their preparation are known in the art (see. 
e.g., U.S. Patent No. 4.816.467, which is incorporated herein by reference), and include the following. For 
20 example, "vertebrate antibodies" refers to antibodies which are tetraoiers or aggregates tiiereof. comprising 
light and heavy chains which are usually aggregated in a T* oonfigiralion and which may or may not have 
covalent linkages between the chains. In vertebrate antibotfies, the anino add sequences of all the chains 
of a particular antibody are homologous with the chains found in oneantibody produced by the lymphbcyte 
which produces tiiat antibody in situ, or in vitro (for ^mple. i) hyforidomas). Vertebrate antibodies 
25 typicallly include native antibodies, for example, purified palydonat antibodies and monoclonal antibodies. 
Examples of the methods for the preparation of these antibodies are de^bed infra 

"Hybrid antibodies" are antibodies wherein one pair of heavy andlight chains Is homologous to those in 
a first antibody, while tiie otiier pair of heavy and light chms is homologous to those in a different second 
antibody. Typically, each of these two pairs will bind diff^nt epitopes, particulariy on different antigens. 
30 This results in the property of "divalence". i.e.. the ability to bind twoantigens simultaneously^ Such hybrids 
may also be formed using chimeric chains, as set fortii below. 

"Chimeric antibodies", are antibodies in which the heavy and/or 6ght chains are fusion proteins. 
Typically the constant domain of the chains is from one particular species and/or class, and the variable 
domains are from a different species and/or class. Also ir^ded is any antibody in which either or botii of 
35 the heavy or light chains are composed of combinations of sequences mimicking the sequences irt 
antibodies of different sources, whether these sources be differing classes, or different species of origin, 
and whether or not the fusion point is at the variable/constant bouidsry. Thus, it is possible to produce 
antitxidies in which neither the constant nor the variable legion mimic known antibody sequences. It then 
becomes possible, for example, to construct antibodies whose variable region has a higher specific affinity 
40 for a particular antigen, or whose constant region can elicit enhanced complement fixation, or to make other 
improvements in properties possessed by a particular constant region 

Anottier example is "altered antibodies", which refers to antibody in which tiie naturally occurring 
amino acid sequence in a vertebrate antibody has been varied. Utilizing recombinant ONA techniques, 
antitx)dies can be redesigned to obtain desired characteristica The pos^bte variations are many, and range 
45 from the changing of one or more amino acids to the complete rede^gn of a region, for examgto* the 
constant region. Changes in the constant region, in general, to attaii desired cellular process characteris- . 
tics, e.g., changes in complement fixation, interaction with merTi)ranee. and otiier effector functions. 
Changes in the variable region may be made to alter antigen binding (^mraceristics. The antibody may alsio 
be engineered to aid tiie specific delivery of a molecule or substance to a specific cell or tissue site. The 
50 desired alterations may be made by known techniques in molecularbiotogy. e.g., recombinant techniques, 
site directed mutagenesis, etc. 

Yet another example are "univalent antibodies", whidi are aggregates comprised of a heavy chain/light 
chain dimer bound to the Fc (i.e., constant) region of a second heavy ch^. This type of antibody escapes. 
■ v. V/antigenIc modulation. See, e 

55 ; . Included also within the definition of antibodies are Tab" hragments of antibodies. The "Fab" region 

and light chains vtlxch are roughly equivalent, or analogous, to th% ^ 
•"A f which comprise tfie branch portion of tiie heavy and lightchains. and which have been shown to^ . ^ ' ' 

to a specified antigen, but which lack tie effector FcpCMrtion . "Fab" includes - 
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aggregates of one heavy and one light diain {c(MTimonly icnown as Fab'), as well as tetranriers containing 
the 2H and 2L chains (referred to as F(*)2). which are capable of selectively reacting with a designated 
antigen or antigen family. "Fab" antibodes may be divided into subsets analogous to those described 
above, i.e, "vertebrate Fab", "hybrid Fab', "chimeric Fab", and "altered Fab". Methods of producing "Fab" 
5 fragments of antibodies are 'known withisthe art and include, for example, proteolysis, and synthesis by 
recombinant techniques. 



ll.H. Diagnostic Oligonucleotide Probes and Kits 



10 




Using the disclosed portions of the isolated HCV cDNAs as a basis, oligomers of approximately 8 
nucleotides or more can be prepared, cfther by exclslDn or synthetically, which hybridize with the HCV 
genome and are useful In identification dthe viral agent{s). ftarther characterization of the viral genome(s). 
as well as in detection of the virus(es) indiseased individuals. The probes for HCV polynucleotides (natural 
75 or derived) are a length which allows the detection of unique viral sequences by hybridization. While 6-8 
nucleotides may be a workable length, sequences of 10-12 nucleotides are preferred, and about ^20 
nucleotides appears optimal. Prefers4)ly, liese sequences will derive from regions^whidi lack heterogeneity:* 
These probes can be prepared using routine methods, including automated oligonucleotide synthetic 
methods. Among useful probes, for exanple. are those derived from the newly isolated clones disclosed 
20 herein, as well as the various oligomers iseful in probing cDNA libraries, set forth t>elow. A complement to 
any unique portion of the HCV genome vil be satisfactory. For use as probes, complete complementarity is 
desirable, though it may be unnecessary is the length of the fragment is increased. 

For use of such probes as diagnostics, the biological sample to be analyzed, such as blood or serum, 
may be treated, if desired, to extract the wcleic adds contained therein. The resulting nucleic acid from the 
25 sample may be subjected to gel ^ectsphoresis or other size separation techniques; alternatively, the 
nucleic acid sample may be dot blotted without size separation. The probes are then labeled. Suitable 
labels, and methods for labeling probes are known in the art and Include, for example, radioactive labels 
incorporated by nick translation or kinasiig. biotin. fiuorescent probes, and chemiluminescent probes. The 
nucleic acids extracted from the samplB are then treated with the labeled probe under hybridization 
30 conditions of suitable stringencies, and polynucleotide duplexes containing the probe are detected. 

The probes can be made completely complementary to the HCV genome. Therefore, usually high 
stringency conditions are desirable in order to prevent false positives. However, conditions oi high 
stringency should only be used if the pmbes are complementary to regions of the viral genome which lack 
heterogeneity. The stringency of hybricfeation is detOTnined by a number of factors during hybridization 
35 and during the washing procedure, incluling temperature, ionic strength, length of time, and concentration 
of formamlde. These factors are outlinedii, for example. Maniatis. T. (1 982). 

Generally, it is expected that the HCV genome sequences will be present in serum of infected 
individuals at relatively low levels, i.e-, £l approximately 10^I(P chimp infectious doses (CID) per ml. This 
level may require that amplification technques be used in hybridization assays. Such techniques are known 
40 in the art. For example, the Enzo Biochemical Corporation "Bio-Bridge" system uses terminal deox- 
y nucleotide transferase to add unmodifEd 3'-polyKlT-4ails to a DNA probe. The poly dT-tailed probe is 
hybridized to the target nucleotide soiuence, and then to a biotin-modified poly-A. POT application 
84/03520 and EPA1 24221 describe a DItt hybridization assay in which: (1) analyte is ainealed to a single- 
stranded DNA probe that is complemeatary to an enzyme^abeled oligonucleotide; and (2) the resulting 
45 tailed duplex is hybridized to an enzymHabeled oligonucleotkJa EPA 204510 describes a DNA hybridiza- 
tion assay in which analyte DNA is contacted with a probe that has a tail, such as a poly-dT tail* an 
amplifier strand that has a sequence thathybridizes to the tail of the probe, such as a poly-A sequence, and 
which is capableof binding a pluraRty of labeled brands. A particularly desirable technique may first involve 
amplification of the target HCV sequences in sera approximately 10,000 fold. I.e.. to approxinriately '10^ ; 
50 sequences/ml. This may be accomplish^, for example, by the polymerase chain reactions (PGR) technique 
described which is by SaiW et al. (198^, by Mullis, U.S. Patent No. 4.683,195, and by Mullis et al. U.S. 
Patent No. 4.683.202. The amplified seqtience(s) may then be detected using a hybridi2Eation assay which is 
described in EP 317,077. published Hay 24, 1989. These hybridization assays, which should detect, 
sequences at the level of 10^ /ml, utilte nucleic add muftimers which bind to single-stranded analyte 
55 nucleic acid, and which also bind to a multiplicity of single-stranded labeled oligonucleotides. A suitafc)!e 
solution .phase sandwich assay which miy be used wth labeled polynucleotide probes, and the nfiethOd?.^ 
■ the. preparation of probes is described inEPO 225.807. publis .-r'^ -r^.:^^'"^^ ^ ^ 

' The probes can be packaged into degnostic kits. Diagnostic kits include the probe DNA, which^ nriay ^le^ 
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labeled; alternafively, the probe ONAmay be unlabeled and the ingredients for labeling may be included in 
the kit in separate contains. The kit may also contain other suitably packaged reagents and materials 
needed .for tie particular hybridization protocol, for example, standards, as wel| as instructions for 
conducting ttB test. -^ •" -■^ 



immunoassay and Diagnostic Kits 



Both the polypeptides which react immunologically with serum containing HCV antibodies, for example. 

10 those detected by the antigenic screerung mettiod described infra, in the Examples, as well those derived 
from or encoded within the isolated ckms described in the Examples, and composites thereof, and the 
antibodies raised against the HCV specific epitopes in tiiese polypeptides, are useful in immunoassays to 
detect presence of HCV antibodies, or the presence of the virus and/or viral antigens, in biological samples. 
Design of the immunoassays Is subject to a great deal of variation, and a variety of these are known in the 

15 art. For example, the immunoass^ may utilize one viral epitope; alternatively, the immunoassay may use a 
combination of viral epitopes derived from these sources; these epitopes may be derived from the same oP^ 
from different viral polypeptides, and may be in separate recombinant or natural polypeptides, or together ifi ' 
the sanrie recombinant potypeptkles. It may use. for example, a monoclonal antibodf^directed towards a . 
viral epitopefs), a combination of monoctonal antibodies directed towards epitopes of one viral antigeri. 

20 monoclonal antibodies directed towards epitopes of different viral antigens, polyclonal antibodies directed 
towards the same viral antigen, or polydonal antibodies directed towards different viral antigens. Protocols 
may be based, for example, upon oompemfon. or direct reaction, or sandwich type assays. Protocols may 
also, for example, use solid supports, or may be by immunoprecipitation. Most assays involve the^use of 
labeled antibody or polypeptkJe; the labete may be, for example, fluorescent, chemiluminescent. radioactive, 

25 or dye molecules. As^ys which amplify the signals from the probe are also known; examples of which are 
assays which utilize biotin and avidin. and enzyme-labeled and mediated immunoassays, such as ELISA 
assays. 

Some of the antigenic regions of the putative poiyprotein have been mapped and identified by 
screening the antigenicitiy of bacterial expression products of HCV cDNAs which encode portions of the 

30 poiyprotein. See the Bcamples. Other antigenic regions of NOV may be detected by expressing the portions 
of the HCV cDNAs in other expr^on systems, including yeast systems and cellular systems derived from 
insects and vertebrates. In addition, studies giving rise to an antigenicity index and 
hydrophobicl^/hydrophillclty profile give rise to information concerning the probability, of a region's 
antigenicity. - 

35 The stucfes on antigenic mapping by expression of HCV cDNAs showed that a number of clones 
containing these cDNAs expressed polyp^tides which were immunologically reactive with serum from 
individuals w&\ NANBH. No single polypeptide was immijnologically reactive with all sera. Five of these 
polypeptides were very immunogenic in that antibodies to the HCV epitopes in these polypeptides were 
detected in many different patiwit sera, although the overiap in detection was not complete. Thus, the 

40 results on the immunogenidty of the polypeptides encoded in the various cjgaesjsyggest that effecient 
detection systems may include the use of panels of epitopes. The epitopes in the panel may be 
constructed into one or multiple polypeptides. 

Kits suitsble for immunodiagnosis ard containing the appropriate labeled reagents are constructed by 
packaging the appropriate materials, including the polypeptides of the invention containing HCV epitopes or 

45 antibodies cfitected against HCV epitopes in suitable containers, along with the remaining reagents and 
materials required for the conduct of the assay, as well as a suitable set of assay instructions. 



Further Characterization of ttie HCV Genome, Virions, and Viral Antigens Using Probes Derivecl T rc^ cDNA 
50 to tlie Viral Genome ' ^ r~-- 

The HCVcDNA sequence infonnafion in the newly isolated clones described in the Examples may be 
used to gain further Information on the sequence of the HCV genome, and for identification and isolation of 
the HCV agent, and thus will aki in its characterization including the nature of the genome, the strup^ of 
55 the viral particle, and the nature of the antigens of which it is composed. This inforrhation; ih^ferri; can lead 
to additional polynucleotide probes, polypeptides derived from the HCV genpij[iei^^|iQ^^dgg^^ 



against HCV epitopes which would be useful for the diagnosis and/or treatmerifpf HCV c^Hh *:»r«;"vj «- v. 
The cDNA sequence information in the abovementioned clones is useful for^the desigri*^!^^^ 
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- isolation of additional cDNA sequences which are derived from as yet undefined regions of the HCV 
genonie(s) from which the cDNAs in clones described herein and in EP 0,316,2lB are derived. For example, 
labeled protias containing a sequence of approxinnately 8 or more nucleotides, and preferably 20 or more 
nucleotides, which are derived from regions close to the s'-temnini or s'-termini of the composite HCV 

5 cDNA sequence shown in Fig. 17 may be used to isolate overlapping cDNA sequences from HCV cDNA 
libraries. Alternatively, characterization of the genomic segments could be from the viral genome(s) isolated 
from purified HCV particles. Methods for purifying HCV particles and for detecting them during the 
purification procedure are described herein, infra. Procedures for isolating polynucleotide genomes from 
viral particles are known in the art, and one procedure which may be used is that described In EP 

70 0,218.316. The isolated genomic segments could then be cloned and sequenced. An example of this 
technique, which utilizes amplification of the sequences to be cloned, is provided infra., and yielded done 
16jh. 

Methods for constructing cDNA libraries are known in the art, and are discussed supra and infra; a 
method for the construction of HCV cDNA libraries in Iambda-gt11 is discussed in EPO Pub. No. 318^16. 
75 However, cDNA libraries which are useful for screening with nucleic acid probes may also be constructed in 
other vectors known in the art, for example, lambda-gtiO (Huynh et al. (1985)). :.J%i{^, 



Screening for Anti-Viral Agents f or HCV 

20 

The availability of cell culture and animal model systems for HCV makes it possible to screen for anti- 
viral agents which inhibit HCV replication, and particulariy for those agents which preferentially allow cell 
growth and multiplication while inhibiting viral replication. These screening methods are known by those of 
skill in the art. Generally, the anti-viral agents are tested at a variety of concentrations,, for their effect on 

25 preventing viral replication in cell culture systems which support viral replication, and the/i for an inhibition 
of infeclivity or of viral pathogenicity (and a low level of toxicity) in an animal model system. 

The methods and compositions provided herein for detecting HCV antigens and HCV polynucleotides 
are useful for screening of anti-viral agents in that they provide an attemative, and perhaps more sensitive 
means, for detecting the agent's effect on viral replication than the cell plaque assay or IDso assay. For 

30 example, the HCV-polynucleotide probes described herein may be used to quantitate tiie amount of viral 
nucleic acid produced in a cell culhjre. This could be accomplished, for example, by hybridization or 
competition hybridization of tiie infected cell nucleic acids with a labeled HCV-polynucleotide probe. For 
example, also. anti-HCV antibodies may be used to identify and quantitate HCV antigen(s) in tiie cell culture 
utilizing the immunoassays described herein. In addition, since it may be desirable to quantitate HCV 

35 antigens in the infected cell culture by a competition assay, tiie polypeptides encoded within the HCV^ 
cDNAs described herein are useful in these competition assays. Generally, a recombinant HCV polypeptide 
derived from tiie HCV cONA would be labeled, and the inhibition of binding of this labeled polypeptide to an 
HCV poJypeptide due to tiie antigen produced in the cell culture system would be monitored. Moreover, 
these techniques are particularly useful in cases where the HCV may be able to replicate in a ce!! line 

40 witiiout cauang cell deatii. 

The anti-viral agents which may be tested for efficacy by these metfiods are known in tine art, and 
include, for example, those which interact with virion components and/or cellular components which are 
necess^ for ttie binding and/or replication of the virus. Typical anti-viral agents may include, for example, 
inhibitors of virion polymerase and/or protease(s) necessary for cleavage of tiie precursor polypeptides. 

45 Ottier anti-wral agents may include tiiose which act with nucleic acids to prevent viral replication, for 
example, anti-sense polynucleotides, etc. 

Antisense polynucleotides molecules are comprised of .a complementary nucleotide sequence which 
allows Ifiem to hytiridize specifically to 'designated regions of genomes or RNAs. Antisense polynucleotides 
may include, for example, molecules that will block protein translation by binding to mRNA. of may be 

50 moleades which prevent replication of viral RNA by transcriptase. Thefy may also^ include molecules virhich 
carry agents (non-cov^ently attached or covalentiy bound) which cause the^ yiral RNA to be inactive by 
causing, for example, scissions in tiie viral RNA. They may also bind to celluiar polynucleotides which , 
enhance and^or are required for viral infectivity. replicative ability, or chronicity. Antisense molecule^ which 
are to hybridize to HCV derived RNAs may be designed based upon the sequence information of ttie HCV 

55 cDNAs provided herein. The antiviral agents based upon anti-sense polynucleotides for HCV , may to; 
designed to bind wtii high specificity, to be of increaseA.$^jj^|ife 

Hence, they may be delivered in specialized systenris,' for e^hp^^ In 
addition, ttiey may include analogs, attached proteins, substituted! Qtfjaft^i:*^^^ 
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Other types of drugs may be based upon polynucleotides which "mimic*' important ccmtrof regions of 
the HCV genome, and which may be therapeutic due to their interactions with key components of the 
system responsible for viral infectivity or replication. 

5 

General Methods 

The general techniques used in extracting the genome from a virus, preparing and probing a cDNA 
library, sequencing clones, constructing expression vectors, transforming cells, performing immunological 
10 assays such as radioimmunoassays and ELISA assays, for growing ceils in culture, and the like are known 
in the art and laboratory manuals are available describing these techniques. However, as a general guide, 
the following sets forth some sources cunrently available for such procedures, and for materials useful in 
carrying them out. 

Both prokaryotic and eukaryotic host ceils may be used for expression of desired cocOng sequences 

75 when appropriate control sequences which are compatible with the designated host are used. Among 
prokaryotic hosts. E. coH is most frequently used. Expression control sequences for prokaryotes includ# . 
promoters, optionally containing operator portions, and ribosome binding sites. Transfer vectors compatible * 
with prokaryotic hosts are commonly derived from, for example. pBR322, a plasmid containing operons 
conferring ampicillin and tetracycline resistance, and the various pUC vectors, which also contain se-^ 

20 quences conferring antibiotic resistance markers. These markers may be used to obtain successful 
transtbrmants by selection. Commonly used prokaryotic control sequences include the Beta-lactam^e 
(penicillinase) and lactose promoter systems (Chang et al. (1977)}. the tryptophan (trp) promoter system 
(Goeddel et al. (1980)) and the lambda-derived Pl promoter and N gene ribosome binding site (Shimatake 
et al. (1981)) and the hybrid tec promoter (De Boer et al. (19^)) derived from sequences of the tip and lac 

25 UV5 promoters. The foregoing systems are particulariy compatible with E. coli; if desired, other prokaryotic 
hosts such as strains of Bacillus or Pseudomonas may.be used, with corresponding con&ol sequences. 

Eukaryotic hosts include yeast and mammalian cells In culture systems. Saccharomyces cerevisiae and 
Saccharomyces carisbergensis are the most commonly used yeast hosts, and are convenient fungal hosts. 
Yeast compatible vectors carry markers which permit selection of successful transformants by confening 

30 prototrophy to auxotrophic mutants or resistance to heavy metals on wild-type strains. Yeast compatible 
vectors may employ the 2 micron origin of replication (Broach et al. (1983)). the combination of CEN3 and 
ARS1 or other means for assuring replication, such as sequences which will result in incorporation of an 
appropriate fragment into the host cell genome. Control sequences for yeast vectors are known in the art 
and include promoters for tiie syntiiesis of glycolytic enzymes (Hess et al. (1968); Holland et al. (1978)), 

3S including the promoter for 3 phosphoglycerate kinase (Hitzeman (1980)). Tenninators may also be included, 
such as those derived from the enolase gene (Holland (1981)). Particularly useful control systems are those 
which comprise the glyceraldehyde-3 phosphate dehydrogenase (GAPDH) promoter or alcohol de- 
hydrogenase (ADH) regulatable promoter, terminators also derived from GAPDH, and if secretion is desired, 
leader sequence from yeast alpha factor. In addition, the transcriptional regulatory region and the transcrip- 

40 tlona! initiation region which are operably linked may be such that they are not naturally associated m the 
wild-type organism. These systms are described in detail in EPO 120.551. published October 3. 1984; 
EPO 116,201. published August 22, 1984; and EPO 164,556. published December 18. 1985. all of which are 
assigned to the herein assignee, and are hereby incorporated herein by reference. 

Mammalian cell lines available as hosts for expression are known in the art and include many 

45 immortalized cell lines available from tiie American Type Culture CoOecton (ATCC). inducfing HeLa ceils, 
Chinese hamster ovary (CHO) ceils, t>aby hamster kidney (BHK) cells, and a number of other cell lines. 
Suitable promoters for mammalian cells are also known in the art and include viral promoters sudi as that 
from Simian Virus 40 (SV40) (Rers (1978)), Rous sarcoma . vims (RSV). adenovirus (ADV), . and bovine 
papilloma vims (BPV). Mammalian cells may also require terminator sequences and poly A addition 

50 sequences; enhancer sequerrces which Increase expression may also be included, am! sequences which 
cause amplification of the gene may also be desirable. These sequences are known In the art. Vectors 
suitable for replication in mammalian cells may Include viral repllcons. or sequences which insure 
integration of tiie appropriate sequences encoding NANBV epitopes into the host genome. 

Transformation may be by any known method for introducing polynucleotides into a host cell, iricluding, ; 

55 for example packaging the polynucleotide in a virus and tiransdudrig a host cell witii the virus, and by direct . 
uptake of the polvnucleotide. The.transformationj^pr o p ^ ^ to be transfaormed. 

For example, transformation oif the E. poll fibsr^^ sequences is 

discussed in tiie Example section, infra. iBact^Hai) tran^^^^ employs- 
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treatment with calcium or rubidium chloride (Cohen (1972); Maniatis (1982)). Yeast transformation by direct 
uptake may be carried out using the method of Hinnen et al. (1978). Mammalian transformations by direct 
uptake may be conducted using the calcium phosphate precipitation method of Graham and Van der Eb 
(1978), or the various known modifications thereof. 

5 Vector construction employs techniques which are known in the art. Site-specific ONA cteavage is 
performed by treating with suitable restriction enzymes under conditions which generally are specified by 
the manufacturer of those commercially available enzymes. In general, about 1 microgram of plasmid or 
DNA sequence is cleaved by 1 unit of enzyme in about 20 microliters buffer solution by incubation of 1-2 hr 
at 37* C. After incubation with the restriction enzyme, protein is removed by phenol/chlorofonfn extraction 

70 and the DNA recovered by precipitation with ethanol. The cleaved fragments may be separated using 
polyacrylamide or agarose gel electrophoresis techniques, according to the general procedures found in 
Methods in Enzymology (1980) 65:499-560. 

Sticky ended cleavage fragments may be blunt ended using E. coll DNA polymerase I (Klenow) in the 
presence of the appropriate deoxynucleotide triphosphates (dNTPs) present in the mixture. Treatment with 

7S S1 nuclease may also be used, resulfih'lf^il |he hydrolysis of any single stranded DNA portions. 

Ligations are carried out using standard buffer and tennperature conditions using T4 DNA Jigase arid 
ATP; sticky end ligations require less ATP and less ligase than blunt end ligations. When vector fragments' 
are used as part of a ligation mixture, the vector fragment is often treated with bacterial alkaline 
phosphatase (BAP) or calf intestinal alkaline phosphatase to remove the s'-phosphate and thus prevent 

20 religation of the vector; alternatively, restriction enzyme digestion of unwanted fragments can be used to 
prevent ligation. 

Ligation mixtures are transformed into suitable cloning hosts, such as E. co!r» and succe^ful transfor- 
mants selected by, for example, antibiotic resistance, and screened for the correct construction. 

Synthetic oligonucleotides may be prepared using an automated oligonucleotide syntfiesizer as de- 

25 scribed by Warner (1984). If desired the synthetic strands may be labeled, wjdi ^^P by treatment with 
polynucleotide kinase in the presence of ^^p-atp, using standard conditions for the reaction, . ' 

DNA sequences, including those isolated from cDNA libraries, may be modified by known techniques, 
including, for example site directed mutagenesis, as described by Zoller (1982). Briefly, the DNA to be 
modified is packaged into phage as a single stranded sequence, and converted to a double stranded DNA 

30 with DNA polymerase using, as a primer, a synthetic oligonucleotide complementary to the portion of the 
DNA to be modified, and having the desired modification included in its own sequence. The resulting 
double stranded DNA is transfbnrned into a phage supporting host bacterium. Cultures of the transformed 
bacteria, which contain replications of each strand of the phage, are plated In agar to obtain plaques. 
Theoretically, 50% of the new plaques contain phage having the mutated sequence, and the remaining 50% 

35 have the original sequence. Replicates of the plaques are hybridized to labeled synthetic probe at 
temperatures and conditions which permit hybridization with the correct strand, but not with the unmodified 
sequence. The sequences which have been identified by hybridization are recovered and cloned. 

DNA libraries may be probed using the procedure of Grunstein and Hogness (1975). Briefly, in this 
procedure, the DNA to be probed is immobilized on nitrocellulose filters, denatured, and prehybridized with 

40 a buffer containing 0-50% fbrmamide. 0.75 M NaCI, 75 mM Na citrate. 0.02% (wt/v) each of bovine serum 
albumin, polyvinyl pyroliidone. and Ficoll, 50 mM Na Phosphate (pH 6.5), 0.1% SDS, and 100 
micrograms/ml carrier denatured DNA. The percentage of formamide in the buffer, as well as the time and 
temperature conditions of the prehybridization and subsequent hybridization steps depends on the strin- 
gency required. Ollgomeric probes which require lower stringency conditions are generally used witii low 

45 percentages of formamide. lower temperatures, and longer hybridization times. Profc)es containing more thian 
30 or 40 nucleotides such as those derived from cDNA or genomic sequences generally employ higher 
( temperatures, e.g.. about 40-42 *C. and a high percentage, e.g., 50%. formamide. Following prehybridiza- 
tion, s'-^^P-labeled oligonucleotide probe Is added to the buffer; and the filters are incubated in this mixtur^: 
under hybridization conditions. After washing, the trisated filters are subjected to autoradiography to show 

50 the location of the hybridized probe; DNA in corresponding locations on the original agar plates is used as 
the source of the desired DNA. 

For routine vector constructions, ligation mixtures are transformed into E. coli strain HB101 or pother 
suitable host, and successful transformants selected by antibiotic resistance~or other maric^«^PIasmlds 
from the transfomnants are then prepared according to the method of Clewell et al. (19(69), usually following 

55 chloramphenicol amplification (Clewell (1972)). The DNA is isolated and analyzed, usually by restriction/ 
enzyme analysis and/p5vS,$q5ij|pg|^^ nrtethod of Sanger et aL (1977) , as , 

further described by Messing ;et ai.\(t (1980). Problems with band- 

compression, which are srimetirnesj obsgr^^ overcome by use of T- 
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deazoguanosine according to Barr et al. (1986). 

The enzyme-linked immunosorbent assay (ELISA) can be used to measure either antigesor antibody 
concentrations. This metliod depends upon conjugation of an enzyme to either an antigen oran antibody* 
and uses the bound enzyme activity as a quantitative label. To measure antibody, the Icnovn antigen is 

5 fixed to a solid phase (6.g.. a microplate or plastic cup)« incubated with test serum dilutions, washed. 
Incubated with anti-immunoglobuirn labeled with an enzyme, and washed again. Enzymes suitable for 
labeling are known in the art, and include, for example, horseradish peroxidase. Enzyme actiity bound to 
the solid phase is measured by adding the specific substrate, and detenmining product formation or 
substrate utilization colorimetrically. The enzyme activity bound is a direct function of the amunt of anti- 

70 body bound. 

To measure antigen, a known specific antibody Is fixed to the solid phase, the test materel containing 
antigen is added, after an incubation the solid phase is washed, and a second enzyme-labeled antittody is 
added. After washing, substrate is added, and enzyme activity is estimated colorimetrically, ad related to 
antigen concentration. 

15 . ^ 

Examples ' 'v' ^ ;. ^ 



20 Described below are examples of the present invention which are provided only for illustrate purposes, 
and not to limit the scope of the present invention. In tight of the present disclosun. numerous 
embodiments within the scope of the claims will be apparent to those of ordinary skill In tite art 



25 Isolation and Sequence of Overlapping HCV cDNA Clones 13i. 26j, CA59a, CA84a. CA1569 andCAISTb 



The clones 13i, 26j. CA59a, CA84a. CA156e and CA167b were isolated from the lannbdhgtll library 
which contains HCV cDNA (ATCC No. 40394), the preparation of which is described in BO Pub. No. 
30 318.216 (published 31 May 1989). and WO 89/04669 (published 1 June 1989). Screening of tie library was 
with the probes described infra, using tiie method described in Huynh (1985). The frequenciis with which 
positive clones appeared with the respective probes was about 1 in 50.000. 

The isolation of clone 13i was accomplished using a synthetic probe derived from the sequence of 
clone 12f. The sequence of tiie probe was: 
35 5 GAA CQT TGC GAT CTG GAA GAG AGG GAG AGG 3'. 

The isolation of clone 26j was accomplished using a probe derived from the 5 -region d clone K9-1. 
The sequence of the probe was: 

5 TAT CAG TTA TGC GAA CGG AAG CGG CCC CGA 3. 

The isolation procedures for clone 12f and for clone k9-1 (also called K9-1) are describedin EPO Pub. 

40 No. 318,216. and their sequences are shown in Figs. 1 and 2, respectively. The HCV cDNAoquences of 
clones 131 and 26], are shown in Rgs. 4 and 5. respectively. Also shown are the amino a^s encod^ 
therein, as well as tiie overlap of done 13i with clone 12f. and the overlap of clone 26j with done 131. The 
sequences for these clones confirmed the sequence of clone K9-1. Clone K9-1 had been isilated from a 
different HCV cDNA library (See EP 0,218,316). 

45 Clone CA59a was isolated utilizing a probe based upon the sequence of the 5,-fegion of done 26j. The 
sequence of this protte was: 

5 CTG QTT AQC AGG GCT TTT CTA TCA CCA CAA 3'. 

A prol>e deriv^ed^from the sequence^of clone CA59a was used to isolate clone CA84a. Thesequence of 
the probe used for this isolation was: 
50 5 AAGGTCCTGGTAGTGCTQCTGCTA TTTGCC3. 

Clone CAl56e was isolated using a probe derived from tiie sequence of clone CA84a. Thesequence.of 
the probe was: 

5' ACT GGA CGA CGC AAG GTT GCA ATT OCT CTA 3'. 

Ctone CA167b was isolated using a probe derived from the sequence of clone CA ISSe. ]he sequence: 
55 of the probe was: 

^^ TTpjsm^^ : : ' . : 

The nuclkttifle^^i^ei^^^^^ lijdV cDN As in clones GA59a, CAB4a. CA156e. andCA167b', are 
shown Rgs. 6,;.7; Sjrztf^^^ amino acids encoded tfierein, as well as tiie oveiiap with the 
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sequences of relevant clones, are also shown in the Rgs. 

Creation of "pi" HCV cONA Library 

5 

A library of HCV cDNA. the "pi" library, was constructed from the sane batch of infectious chimpanzee 
plasma used to construct the Iambda-gt11 HCV cDNA library (ATCC No. 45394) described in EPO Pub. No. 
318.216, and utilizing essentially the same techniques. However, oonstnctlon of the pi library utilized a 
TO primer-extension method, in which the primer for reverse transcriptase waibased on the sequence of clone 
CA59A. The sequence of the primer was; 

5' GGT GAC GTG GGT TTC 3'. ' 



15 Isolation and Sequence of Clone pi14a 

r •^■-'^ 

Screening of the "pi" HCV cDNA library described supra., with the prd)e used to isolate done CAIB/^b 
(See supra.) yielded clone pi14a. The clone contains about ^0 base pain of cDNA which overlaps clones 
20 CA167b. CA156e, CA84a and CA59a, which were isolated from the lambdgt-n HCV cDNA Bbrary (ATCC 
No. 40394). In addition, pi14a also contains about 250 base pairs of DNAwhich are upstresmn of the HCV 
cDNA in clone CA167b. 



25 Isolation and Sequence of Clones CA216a. CA29Qa aniag30a 

Based on the sequence of clone CA167b a synthetic probe was madebaving the followmg sequence: 
5' GGC TTT ACC ACG TCA CCA ATG ATT QCC CTA 3 
30 The above probe was used to screen the lambda-gtll library (ATCC «o. 40394). which yielded; clone 
CA2iea, whose HCV sequences are shown in Rg. 10. 

Another probe was made based on the sequence of done CA2iea hawig the following sequence: 
5 TTT GGG TAA GGT CAT CGA TAC CCT TAC GTG 3 

Screening the lambda-gtll library (ATCC No. 40394) with this probe yielded done GA^a, the HCV 
35 sequences therein being sliown in Fig. 11. • . 

In a parallel approach, a primer-extension cDNA library was made ustig nucleic acid extracted from the 

same infectious plasma used in the original lambda-gtll cDNA library cfescribed above. The primer used 

was based on the sequence of clones CA2l6a and CA290a: 

5' GAA GGC GCA CGT AAG 3' 
40 The cDNA library was made using methods similar to those described peviously for tibr^ies used in the 

isolation of clones pi14a and k9-1. The probe used to screen this libraii was based on the sequence of 

done CA290a: 

5 CCG GCG TAG GTC GCG CAA TTT GGG TAA 3' 

Clone ag30a was isolated from the new library with the atx>ve probe, andcontained about 670 basepairs of 
45 HCV sequence. See Fig. 12. Part of this sequence overlaps the HCV sequence of clones CA216a and 
CA290a. About 300 base-pairs of the agSOa sequence, however, is upsteam of the sequence from done 
CA290a. The non-overlapping sequence shows a start codon (*) and stopcodons that may incficate the start 
of the HCV ORE. Also indicated in Fig. 12 are putative small encoded peptides (#) which may ptay^a role in 
regulating translation, as well as^Sie putative first amino acid of the putativfi polypeptide (/). and downstream 
50 amino acids encoded therein. 



Isolation and Sequence of Clone CA2Q5a 

^^^^^^^^^ |iL?4po!a]te^i original lambda gt-11 library (ATCC No. 40394), a synthetic 

^^(OjB§^p"n^^ in clone CA290a (Rg. 11). The segience of the probe Wa«. ; 

VS^ScA^i^^ A6T TTA CTT GTT GCC 3 . \ j 
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The sequence of the HCV cDMA on CA205a, shown in Rg. 13, overlaps wrth the cDNA sequences in both 
clones ag30a and CA290a. The overlap of the sequence with that of CA290a is shown by the dotted line 
above the sequence (the figure also shows the putative amino acids encoded in this fragn^ent). 

As observed from the HCV cONA sequences In clones CA205a and ag3Qa, the putative HCV 
5 polyprotein appears to begin at the ATG start codon; the HCV sequences in both clones contain an in- 
frame, contiguous double stop codon (TGATAG) for^ two nucleotides upstream from this ATG. ThS HCV 
ORF appears to begin after these stop codons. and to extend for at feast 8907 nucleotides (See the 
composite HCV cDN A shown in Rg. 17). * — 



10 



Isolation and Sequence of Clone 18g 



Based on the sequence of clone agSOa (See Fjg. 12) and of an overtepping clone from the original 
T5 lambda gt-11 library (ATCC No. 40394), CA230a. a synthetic probe was made having the following 
sequence: ^ 

5' CCA TAG TGG TCT GCG GAA CCG GTG AGT ACA 3 . ' * 

Screening of the original lambda-gtll HCV cDNA libary with the probe yielded clone 18g, the HCV cDNA 
sequence of which is shown in Rg. 14. Also ^owitin the figure are the overlap with clone agSOa. and 
20 putative polypeptides encoded withm the HCV cDNA. 

The cDNA in clone 18g (C18g or I8g) overlaps that in clones ag3(te and CA205a. described supra. The 
sequence of CIBg also contains the double stop cddon region obsen^ed in done agSOa. The polynucleotide 
region upstream of these stop codons presumably represents part of the 5 -region of the HCV genome, 
which may contain short ORFs, and which can be oonfirmed by direct sequenc ing of the purified HCV 
2S genome. These putative small encoded peptides mairplay a regulatory role in translation. The region of the 
HCV genome upstream of that represented by CISgcan be isolated few sequence analysis using essentially 
the technique described in EPO Pub. No. 318^16 Ibr isolating cDMA ^uences upstream of the HCV 
cDNA sequence in clone I2f. Essentially, small synlietic oligonucleotide primers of reverse transcriptase, 
which are based upon the sequence of Ct^, are synthesized and used to bind to the corresponding 
30 sequence in HCV genomic RNA. The primer sequences are proximal Id the known 5 -terminal of C18g. but 
sufficientiy downstream to allow tiie design of probe sequences upstream of the primer sequences. Known 
standard methods of priming and clonlr^ ar eused. The resulting cDNA libraries are screened with 
sequences upstream of tiie priming sites (as deduced from the elucidated sequence of Cidg). The HCV 
genomic RNA is obtained from either plasma or liver samples from Individuals with NANBH. Since HCV 
35 appears to be a plavi-like virus, the s'-termfnus of the genome may t>e modified with a "cap" structure. It is 
known that Flavivlrus genomes contain s'-termlnal "cap" stmctures. (Yelknv Fever virus, Rice et al. (1988); 
Dengue virus, Hahn et al (1988); Japanese EncephalKs Virus (1987)). 

iso^^tio" Sequence of Clones Irom the beta-HCV cDNA library 

Clones containing cONA representative of the 3'-termlnal region of the HCV genome were Isolated from 
a cDNA library constructed from the original infectious chimpanzee plasma pool which was used for the 
45 creation of the HCV cDNA lambda-gtll library (ATCC No. 40394), descrft)ed in EPO Pub. No; 318.216. In 
order to create tiie DNA library. RNA extifacted front the plasma was "taUed" witii poly rA using poly (rA) 
polymerase, and cDNA was synthesized using oligo(dT)i2-i8 as a prnner for reverse transcriptase. The 
^resulting RNA:cDNA hybrid was digested vnth RNAase H, and converted to double stranded HCV cDNA. 
The resulting HCV cDNA was cloned into lambda-gtIO, using essentially tfie technique described In Huynh 
50>^K1985).. yieW beta {or b) HCV cDNA library. The procedures used were as follows. 

An aliquot (I2mt) of the plasma was treated witii proteinase and extracted with an equal volume of 
phendr saturated with 0.05M Tris-CI, pH 7.5, 0.(15% (v/v) beta-mercaptoethanol. 0.1% (w/v) hydrox- 
yquinolone. 1 mM EDTA. The reciting aqueous phase was re-extracted %tnth the. phenol mixture; followed 
c by 3 extractions with a 1:1 mixture containing phendand chloroformdsoamyl alcohol (24:1), followed by 2 
^55 -extractions with a mixture of chloroform and isoamyl alcohol (1:1). Subsequent to adjustment of the aqueous . 
^^iq^^Q^jp^^, w respect to NaCI, nucleic acids in the aqueous phase were precipitated pvernjghta^^ 
^CrWith ,2.5 voi^ of cold absolute ethanoLThe precipitates were collected by cenh-iftga^bh^^^^^ 
^^tr^^^^f^^^ for 40 min., washed with 70% ethanol containing 20 mM NaCl. and with 100%- cold ethanol, 
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dried for 5 min. in a dssstcator. and dissolved in water. 

The isolated nucleic acids from the infectious chimpanzee plasma pool were tailed with poly rA utilizing 
poly-A polymerase &i the presence of human frfacenta ribonuclease Inhibitor (HPRI) (purchased from 
Amersham Corp.), ulrfng MS2 RNA as carrier, isolated nucleic acids equivalent to that in 2 ml of plasma 

5 were incubated in a solution containing TWIN (50 mM Tris HCI. pH 7.9. 10 mM MgCiz, 250 mM NaCI, 2.5 
mM MnQa, 2 mM dlhiothreltol (DTT)). 40 micromolar alpha-pP] ATP, 20 units HPRI (Amersham^ Corp.). 
and about 9 to 10 urats of RNase free poly-A polymerase (BRL). Incubation was for 10 min. at 37* C. and 
the reactions were stopped with EDTA (final concentration about 250 mM). The solution was extracted with 
an equal volume of phenol-chloroform, and with an equal volume of chloroform, and nucleic acids were 

to precipitated ovemighlat -20* C with 2.5 volumes of elhanol in the presence of 200 mM NaCI. 



Isolation of Clone bSa 

The beta HCV d3NA Hbt'ary was screened by hybridization using a synthetic probe, which had a- 
sequ^ce based upon the HCV cDNA sequence in clone 15e. The isolation of clone 15e is described in : . - ^ 
EPO Pub, No. 318,216. and its sequence is shown in Fig. 3. The sequence of the synthetic probe was: ; 
5 ATT GCG AGA TC[ ACG GGG CCT GCT ACT CCA 3'. 

20 Screening of the library yielded clone beta-5a (bSa). which contains an HCV cDNA region of approximately 
1000 base paire. Tte 5 -region of this cDNA overlaps clones 35f. 19g. 26g, and 15e (these clones are 
described supra). The region between the s'-tenmrnal poly-A sequence and the 3'-sequence which overlaps 
clone 15e. contains ipproximately 200 base pairs. This done allows the identification of a region of the 3 - 
terminal sequence the HCV genome. 

25 The sequence d b5a is contained within the sequence of the HCV cDNA In clone 16jh (described infra). 
Moreover, the sequence is also present in CC34a, isolated from the original lambda-gtll library (ATCC No. 
40394), (The original Iambda-gt11 library is referred to herein as the "C" library). * 

30 Isolation and Sequence of Clones Generated by PCR Amplification of the 3'-Reglon of the HCV Genome 



Multiple cDNA dones have been generated which contain nucleotide sequences derived from the 3- 
reglon of the HCV genome. This was accompfished by amplifying a targeted region of the genome by a 

35 polymerase chain reaction technique described in Saiki et ai. (1986), anc^in Saiki et al. (1988). which was 
modified as described below. The HCV RNA which was amplified was obtained from the original infectious 
chimpanzee plasma pool which was used for the creation of the HCV cDNA lambda-gtll library (ATCC No. 
40394) described in EPO Pub. No. 318.216. Isolation of the HCV RNA was as described supra. The isolated 
RNA was tailed at the 3'-end with ATP by E coli poly-A polymerase as described in Sippel (1973), except 

40 tiiat the nucleic acids isolated from chimp serum were substituted for the nucleic acid substrate. The tailed 
RNA was then reverse transcribed into cDNA by reverse transcriptase, using an oligo dT-primer adapter, 
essentially as described by Han (1987), except that the components and sequence of the primer-adapter 
were: 



4S 



Stuffer 


Noti 


SP6 Promoter 


Primer 


AATTC 


QCGGCCGC 


CATACGATTTAGGTGACACTATAGAA 


Tl5 



so 



The resultant cDNAwas subjected to ampUficafion by PCR using two primers: 



65 



Primer 


Sequence 


JH32 (30mer) 
JH11 (20mer) 


ATAGCGGCCGCCCTCGATTGCQAGATCTAC . 
AATTCG6GCGGCCGCCATACGA 
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The JH32 primer contained 20 nucleotide sequences hybridizable to the s'-end of the target region in the 
cDNA. with an estimated Tm of 66' C, The JH11 was derived from a portion of the ollgo dT-primer adapter; 
thus, it is specific to the 3 -end of the cDNA with a Tm of 64* C. Both primers were designed to have a 
recognition site for the restriction enzyme. Not!, at the s'-end. for use in subsequent cloning of the amplified 
5 HCV cDNA. 

The PGR reaction was earned out by suspending the cDNA and the primers in 1(K) microliters of 
reaction mixture containing the four deoxynucleoside triphosphates, buffer salts and metal ions, and a 
thenmostable DNA polymerase isolated from Thermus aquaticus (Taq polymerase), which are in a Perkin 
Elmer Cetus PCR kit (N801-0043 or N801-0055). The PCR reaction was performed for 35 cycles in a Perkin 

10 Elmer Cetus DNA thermal cycler. Each cycle consisted of a 1.5 min denaturation step at 94* C, an 
annealing step at 60* C for 2 min, and a primer extension step at 72* C for 3 min. The PCR products were 
subjected to Soutiiem blot analysis using a 30 nucleotide probe. JM34. the sequence of which was based 
upon that of the 3 -terminal region of clone 15e. The secjuence of JH34 is: 
5 CTT GAT CTA CCT CCA ATC ATT CAA AGA CTC 3 . 

75 The PCR products detected by the HCV cDNA probe ranged in size from about 50 to about 400 base pairs. , 
In order to clone the amplified HCV cDNA. the PCR products were cleaved with NotI and size selected 
by polyacrylamide gel electrophoresis. DNA larger than 300 base pairs was cloned into the NotI- site- of, 
pUCIBS The vector pUCIBS is constructed by including a NotI polylinker cloned between the EcoRI arid . 
Sail sites of pUCIB. The clones were screened for HCV cONA using the JH34 probe. A number of positive 

20 clones were obtained and sequenced. The nucleotide sequence of the HCV cDNA insert in one of these 
clones, lejh, and the amino acids encoded therein, are shown in Rg. 15. A nucleotide heterogeneity, 
detected in the sequence of the HCV cDNA in clone 16jh as compared to another clone of this.fegion. is 
indicated in the figure. 

25 

Compiled HCV cDNA Sequences 



An HCV cDNA sequence has been compiled from a series of overlapping clones derived from the 

30 various HCV cDNA libraries described supra.. In this sequence, the compiled HCV cDNA sequence 
obtained from clones b114a. 18g, ag30a, CA205a, CA290a. CA216a, pi14a, CA167b. CA156e, CA84a. and 
CA59a is upstream of the compiled HCV cDNA sequence published in EPO Pub. No. 318.216, which is 
shown in Rg. 16. The compiled HCV cDNA sequence obtained from clones fa5a and 16jh downstream of 
the compiled HCV cONA sequence published in EPO Pub. No. 318.216. ^ 

35 Fig. 17 shows the compiled HCV cDNA sequence derived from the above-descrit>ed xlones and th§ 
compiled HCV cDNA sequence published in EPO Pub. No. 318,216. The clones from which the sequence 
was derived are b114a. 18g. ag30a. CA205a. CA290a, CA216a, pl14a. CA167b. CA156e. CA84a. CA59a. 
K&.1 (also called k9-1).26j. 13i, 12f, 141. lib. 7f. 7e, 8h. 33c. 40b. 37b. 35, 36. 81. 32. 33b. 25c. 14c, 8f. 33f. 
33g, 39c, 35f. I9g. 26g. 15e. b5a. and 16jh. In the figure the three dashes above the sequence indicate the 

40 position of the putative initiator methionine codon. 

Clone b114a was obtained using the cloning procedure described for clone b5a, supra, except that the 
probe was the synthetic probe used to detect clone 18g. supra. Clone b114a overiaps with clones 1 89. 
agSOa. and CA205a. except that clone bl I4a contains an extra two nucleotides upstream of the sequence in 
clone 18g (i.e., 5'-CA). These extra two nucleotides have been included in the HCV genomic sequence 

45 shown in Rg. 17. 

It should be noted that although several of the clones described supra, have been obtained from 
libraries other than the original HCV cDNA lambda-gtll C library (ATCC No. 40394). these clones contain 
HCV cDNA sequences which overlap HCV cDNA sequences in^the original library. Thus, essentially all of 
the HCV sequence is derivable from the original iambda-gt11 C library (ATCc'TNtoi 40394) wKfch was used 
so to isolate the first HCV cDNA clone (5-1-1). The isolation of clone 5-1-1. Is described in EPO Pub. .No; 
318.216, ^ 



Purification of Fusion polypeptide ClOO-3 (Alternate meth6d) 6 ; 

The fusion polypeptide, CI 00-3 (also called HCV ciofci anp'^^ai^|^^j^|^ . 
superoxide dismutase (SOD) at the N-terminus an in-frame iC1C(6 j?IG*% 
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method for preparing the polypeptide by expression in yeast, and differential extraction of the insoluble 
fraction of the extracted host yeast cells, is described in EPO Pub. No. 318^16. An alternative method for 
the preparation of this fusion polypeptide is described below. In this method the antigen is precipitated from 
the crude cell lysate with acetone; the acetone precipitated antigen is then subjected to ion-exchange 

5 chromatography, and further purified by gel filtration. 

The hJSion polypeptide, CI 00-3 (HCV c100-3), is expressed In yeast strain JSC 308 (ATCC No. 20879) 
transformed with pAB24C100-3 (ATCC No. 67976); the transformed yeast are grown under conditions which 
allow expression (i.e., by growth in YEP containing 1% glucose). (See EPO Pub. No. 318,216). A cell lysate 
is prepared by suspending the cells in Buffer A (20 mM Tris HCI. pH 8.0, 1 mM EDTA, 1 mM PMSF. The 

10 cells are brdcen by grinding with glass beads in a Dynomiil type homogenizer or its equivalent. The extent 
of cell breakage is monitored by counting cells under a microscope with phase optics. Broken cells appear 
dark, while viable cells are light-colored. The percentage of broken cells is detennined. 

When the percentage of broken cells is approximately 90% or greater, the broken cell debris is 
separated from the glass beads by centrifugation. and the glass beads are washed with Buffer A. After 

15 combining the washes and homogenate. the insoluble material in the lysate is obtained by centrifugation. 
The material in the pellet is washed to remove soluble proteins by suspension in Buffer B (50 mM ^lypine, 
pH 12.0, 1 mM DTT. 500 mM NaCI), followed by Buffer C (50 mM glycine, pH 10.0. 1;mM DTi>- T^^^^^ 
insoluble material is recovered by centrifugation. and solubilized by suspension in Buffer C containing SDjS: 
The extract solution may be heated in the presence of t^eta-mercaptoethanol and concentrated by 

20 ultrafiltration. The HCV cl00-3 in the extract is precipitated with cold acetone, tf desired, the precipitate may 
be stored at temperatures at about or below -1 5 ' C. 

Prior to ion exchange chromatography, the acetone precipitated material is recovered by centrifugation. 
and may be dried under nitrogen. The precipitate is suspended In Buffer D (50 mM glycine. pH 10.0. 1 mM 
DTT, 7 M urea), and centrifuged to pellet insoluble material. The supernatant material is applied to an anion 

25 exchange column previously equilibrated with Buffer D. Fractions are collected and analyzed by ultraviolet 
absorbance or gel electrophoresis on SDS polyacrylamide gels. Tliose fractions containing the HCV c100-3 
polypeptide are pooled. 

In order to purify the HCV c100-3 polypeptide by gel filtration, the pooled fractions from the ion- 
exchange column are heated in the presence of beta-mercaptoethanol and SDS. and the eluate is 

30 concentrated by ultrafiltration. The concentrate is applied to a gel filtration column previously equilibrated 
with Buffer E (20 mM Tris HCI. pH 7.0, 1 mM DTT, 0.1% SDS). The presence of HCV c100-3 in the eluted 
fractions, as well as the presence of impurities, are determined by gel electrophoresis on polyacrylamide 
gels in the presence of SDS and visualization of the polypeptides. Those fractions containing purified HCV 
clOO-3 are pooled. Fractions high in HCV c100-3 may be further purified by repeating the gel filtration 

35 process. If the removal of particulate material is desired, the HCV c10D-3 containing material may be filtered 
through a 0.22 micron filter. 

Expression and Antigenicity of Polypeptides Encoded in HCV cDNA 

40 



Polypeptides Expressed in E. coli 

46 

The polypeptides encoded in a number of HCV cDNAs which the HCV genomic ORF were 
expressed in E. coli, and tested for their antigenicity using serum obtained from a variety of individuals with 
NANBH. The"expmssion vectors containing the cloned HCV cDNAs were constructed from pSODcfl 
(Steimer et al. (1986). In order to be certain that a correct reading frame would be achieved, three separate 
so expression vectors, pcflAB. pcflCD. and pcflEf were crejated by ligaling either. of three linkers, AB, CD. 
and EF to a BamHI-EcoRI fragment derived by digesting to completion the vector pSODcfl with EcpRI and 
BamHI. followed by treatment with alkaline phosphatase. The linkers were created from six oligomers. A, B, ; 
C. D, E. and F. Each oligomer was phosphorylated by treatnrwnt with kinase in the presence of ATP prior to 
annealing to its complementary oligomer. The sequences of the synthetic linkers were.the following: 



Name 



EP 0 388 232 A1 
DNA Sequence (5' to 3') 



A GATC CTG AAT TCC TGA TAA 

B GAG TTA AGG ACT ATT TTA A 

C GATC CGA ATT CTG TGA TAA 

D GCT TAA GAC ACT ATT TTA A 

E GATC CTG GAA TTC TGA TAA 

F GAC CTT AAG ACT ATT TTA A 

Each of the three linkers destroys the original EcoRI site, and creates a new EcoRI site within the linker.'but 
within a different reading frame. Hence, the HCV cDNA EcoRI fragments isolated from the clones when 
inserted into the expression vector, were in three different reading frames. 

The HCV cDNA fragments in the designated (ambda-gtl 1 clones were excised by digestion with EcoRI; 
each fragment was inserted into pcflAB. pcflCD. and pcflEF. These expression constructs were then 
transfonmed into D1210 E. coli cells, the transformants were cloned, and recombinant bacteria from each 
clone were induced to express the fusion polypeptides by growing the bacteria in the presence of IPTG. 

Expression products of the indicated HCV cDNAs were tested for antigenicity by direct immunological 
screening of the colonies, using a modification of the method described in Helfman et al. (1983). Briefly, as 
shown in Fig. 18. the bacteria were plated onto nitrocellulose filters overlaid on ampldllin plates to give 
approximately 1,000 colonies per filter. Colonies were replica plated onto nitrocellulose filters, and the 
replicas were regrown overnight in the presence of 2 mM IPTG and ampicillin. The bacterial colonies were 
lysed by suspending the nitrocellulose filters for about 16 to 20 min In an atmosphere saturated with CHCb 
vapor. Each filter then was placed in an individual 100 mm Petri dish containing 10 ml of 50 mM Tris HCl, 
pH 7,5. 150 mM NaCI. 5 mM MgClg. 3% (w/v) BSA. 40 micrograms/ml lysozyme. and 0.1 microgram/ml 
DNase. The plates were agitated gently for at least 8 hours at room temperature. The filters were rinsed in 
TBST (50 mM Tris HCI, pHS.O. 150 mM NaCI. 0.005% Tween 20). After incubation, the c«!l residues were 
rinsed and incubated in TBS (TBST without Tween) containing 10% sheep serum; incubation was for 1 
hour. The filters were then incubated with pretreated sera in TBS from individuals with NANBH, which * 
included: 3 chimpanzees; 8 patients with chronic NANBH wiiose sera were positive with respect to 
antibodies to HCV C100-3 polypeptide (described in EPO Pub. No. 318.216. and supra.) (also called ClOO); 
8 patients with chronic NANBH whose sera were negative for anti-ClOO antibodies; a convalescent patient 
whose serum was negative for anti-CIOQ antibodies; and 6 patients with community acquired NANBH. 
including one whose sera was strongly positive with respect to anti-ClOO antibodies, and one whose sera 
was marginally positive with respect to anti-ClOO antibodies. The sera, diluted in TBS. was pretreated by 
preabsorption with hSOD. Incubation of the filters with the sera was for at least two hours. After incubation, 
the filters were washed two times for 30 min with TBST. Labeling of expressed proteins to which antibodies 
in the sera bound was accomplished by incubation for 2 hours with *2s labeled sheep anti-human antibody. 
After washing, the filters were washed twice for 30 min with TBST; dried, and autoradiographed. 

A number of clones (see infra.) expressed polypeptides containing HCV epitopes which were im 
munoiogically reactive with serum from individuals with NANBH. Rve of these polypeptides were very 
immunogenic in that antibodies to HCV epitopes in these jjply|>epti^gjvere dete^ in many different 
patient sera The clones encoding these, polypeptides, and the iocattoTS'the fwTv^ In the putative 
HCV polyprotein (wherein the amino acid numbers t)egin' with the putative initiator codon) are the following: 
clone 5-1-1. amino acids 1694-1735; clone C100, amino acids 1569-1 931: . clone 33c, amino acids 1192- 
1457; done CA279a. amino acids 1-84; and clone CA290a amino acids 9-177. The Ideation of the 
Immunogenic polypeptides within the putative HCV polyprotein are shown immediately below. , 
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Clones encoding polypeptides of proven 
reactivity witfi sera from NANBH patients. 




Location within the HCV 
nolvnrotsin 


(amino acid no. beginning with 
putative initiator methionine) 




1-84 






1 Ol 






9-177 




1 1 O I *tD f 


*HJD 




5-1-1 


1694-1735 


81 


1689-1805 


33b 


1916-2021 


25c 


1949-2124 


14c 


2054-2223 


8f 


2200-3325 


33f 


2287-2385 


33g 


2348-2464 


390 


2371-2502 


15e 


2798-2888 


CI 00 


1569-1931 



30 



35 



40 



The results on the immunogenicity of the polypeptides encoded in the variwis clones examined 
suggest efficient detection and immunization systems may include panels of HCV poljpeptides^epitopes. 

Expression of HCV Epitopes in Yeast 

Three different yeast expression vectors which allow the insertion of HCV cDNA nb three different read 
ing frames are constructed. The construction of one of the vectors, pAB24C100-3 is described in B*0 Pub. 
No. 318.216. In the studies below, the HCV cDNA from the clones listed in supra. In the antigenicity 
mapping study using the E. coll expressed products are substituted for the OOO HCV cDNA. The 
construction of the other vectors replaces the adaptor described in the above E-coK studies with one of the 
following adaptors: 

Adaptor 1 

ATT TTG AAT TCC TAA TGA G 
. AC. TTA AGG ATT ACT GAG CT 

50 ' * ^-H>c<^ ^ ^^^^^^ _ . . 

Adaptor 2 

: . GAA TTC TAA TGA G 

55 ^^^^ CTT AAG ATT ACT CAG CT ^ 



45 



The. insjsirte^^ transformed with tfie vectors, using the expression 
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conditions described supra, for the expression of the fusion polypeptide, C100-3. The resulting polypeptides 
are screened using the sera from individuals with (MANBH described supra, for the screening of im- 
nnunogenic polypeptides encoded in HCV cDNAs expressed in£ colL 



Comparison of the Hydrophobic Profiles of HCV Polyprotelns with West Nile Virus Polyprotein and with Dengue 

Virus NS1 



10 The hydrophobictty profile of an HCV polyprotein segnent was compared witii that of a typical 
Flavivirus. West Nile virus. The polypeptide sequence of the West Nile virus polyprotein was deduced from 
the known polynucleotide sequences encoding the non-structural proteins of that wrus. The HCV poly- 
protein sequence was deduced from the sequence of overlapping cDNA clones. The profiles were 
determined using an antigen program which uses a window of 7 amino acid width (tiie amino acid in 

15 question, and 3 residues on each side) to report the average hydrophobicity about a given amino acid 
residue. The parameters giving the reactive hydrophobicity for each amino acid residue are from Kyte and, 
Doolittle (1982). Rg. 19 shows the hydrophobic profiles of the two polyprotelns; the areas eiQrres^nding to 
the non-structural proteins of West Nile virus, nsl through nsS. are indicated in the' figure. As seen in th^' 
figure, there is a general similarity in tiie profiles of the HCV polyprotein and the West Nile virus 

20 polyprotein. 

The sequence of the amino acids encoded in the sVegpon of HCV cONA shown in Fig. 16 has been 
compared with the corresponding region of one of the stm)s of Dengue virus, described supra., with 
respect to the profile of regions of hydrophobicity and hydn)phincity (data not shown). This comparison 
indicated that the polypeptides from HCV and Dengue encoded in this region, which con-esponds to tiie 
25 region encoding NSl (or a portion tiiereof). have a similar hydrophobic/hydrophilic profile. 

The similarity in hydrophobidty profiles, in combination vifth tiie previously identi'fied homologies in the 
amino acid sequences of HCV and Dengue Flavivirus in EP 0218.316 suggests tiiat HCV is related to tiiese 
members of the Ravivirus family. 

30 

Characterization of tfie Putative Polypeptides Encoded Wrthin the HCV ORF 

The sequence of ttie HCV cDNA sense strand, shown in Rg. 17. was deduced from the overlapping 
HCV cONAs in the various clones described in EPO Pub. No.318^6 and those described supra. It may be 

35 deduced from the sequence tfiat the HCV genome contains primarily one long continuous ORR which 
encodes a polyprotein. In the sequence, nucleotide number 1 oon-esponds to the firsft nucleotide of the 
Initiator MET codon; minus numbers indicate Uiat the nucleofdes are that distance away in tfie s'-direction 
(upstream), while positive numbers indicate tiiat the nudeotides are that distance away in the 3 -direction 
(downstream). The composite sequence shows the "sense" stand of the HCV cDNA. 

40 The amino acid sequence of the putative HCVpolyprofceii deduced from the HCV cDNA sense strand 
sequence is also shown In Rg. 17. where position 1 begins wih the putative initiator methionine. 

Possible protein domains of the encoded HCV polyprotein. as well as the approximate boundaries, are 
the following (the polypeptides identified within tiie parentieses are those which are encoded in tiie 
Flavivirus domain): 
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Putative Domain 


Approximate 




Boundary 




(amino acid 




nos.) 


"C" (nucieocapsid protein) 


1-120 


"E" (Virion envelope protein(s) and possibly matrix (M) proteins 


120-400 


"NS1 " (complement fixation antigen?) 


400-680 


"NS2*' (unknown function) 


660-1050 


"NS3" (protease?) 


1050-1640 


"NS4" (unlcnown function) 


1640-2000 


"NS5" (polymerase) 


2000-? end 



'5 . . 

It should bo noted, liowever. that hydrophobictty profiles (described infra), indicate .that HCV diverges 
from the Ravivlrus model, particularly with respect- to' the region upstream* of NSa^Mqreoyer, - the ^ - 
boundaries indicated are not intended to show finm demarcations between the putative polypeptides. 

20 

The Hydrophllic and Antigenic Profile of the Polypeptide 

Profiles of the hydrophilicity/hydroj^obicity and the antigenic index of the putative polyprotein encoded 
25 in the HCV cDNA sequence shown in Fig. 16 were determined by computer analysis. The program for 
hydrophilicity/hydrophobicity was as described supra. The antigenic index results from a computer program 
which relies on the following criteria: 1) surface probability. 2) prediction of alpha-helicity by two different 
methods: 3) prediction of beta-sheet regions by two different methods; 4) prediction of U-turns by two 
different methods: 5) hydrophilicity/hydrophcrtMcity; and flexibility. The traces of the profiles generated by 
30 the computer analyses are shown in Rg. 20. In the hydrophiliclty profile, deflection at>ove the abscissa 
indicates hydrophilicity. and below the abscissa indicates hydrophobicity. The probability that a polypeptide 
region is antigenic is usually considered to increase when there is a deflection upward from the abscissa in 
the hydrophllic and/or antigenic profile. It should be noted, however, that these profiles are not necessarily 
indicators of the strength of the immunogenicity of a polypeptide. 

35 • 

Identiftcation of Co-linear Peptides in HCV and Flaviviruses 

The amino acid sequence of the putative polyprotein encoded in the HCV cDNA sense strand was 
40 compared with the known amino a<»d sequences of several members of Flaviviruses. The comparison 
shows that homology is slight, but due to the regions in which it is found, it is prabably significant. The 
conserved colinear regions are shown in Fig. 21. The amino acid numbers listed below the sequences 
represent the number in the putative HCV polyprotein (See Fig. 17.) 

The spacing of these conserve motifs is similar between the Flaviviruses and HCV, and implies that 
45 there is some similarity between HCV and these flavlviral agents. 

The following listed materials are on deposit under the terms of the Budapest Treaty with the American 
Type Culture Collection (ATCC). 12301 ParWawn Dr.» Rockvflle, Maryland 20852. and have been assigned 
the following Accession Numbers. v 
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lamDaa-gt 1 1 


ATCC 
NO. 


Deposit Date 


HCVcDNA library 


40394 


1 Dec. 1987 


clone 81 


40388 


17 Nov. 1987 


done 91 


40389 


17 Nov, 1987 


clone 1-2 


40390 


17 Nov. 1987 


clone 5-1-1 


40391 


18 Nov. 1987 


clone 12f 


40514 


10 Nov. 1988 


done 35f 


40511 


10 Nov. 1988 


clone 15e 


40513 


10 Nov. 1988 


done K9-1 


40512 


10 Nov. 1988 


JSC 308 


20879 


5 May 1988 


pS356 


67683 


29 April 1988 



In addition, tie following deposits were made on 11 May 1989. 



Strain 


Linkers 


ATCC 
No. 


D1210(Cf1/5-M) 


EF 


67967 


D1210 (Cf1/81) 


EF 


67968 


D1210 (Cf1/CA74a) 


EF 


67969 


D1210(Cf1/35f) 


AB 


67970 


D1210 (Cf1/279a) 


EF 


67971 


D1210(Cf1/C36) 


CD 


67972 


D1210 (Cf1/13i) 


AB 


67973 


D1210(Cf1/C33b) 


EF 


67974 


D1210(Cf1/CA290a) 


AB 


67975 


HB101 (AB24/C100#3R) 




67976 



35 The following derivatives of strain D1210 were deposited on 3 l^ay 1989. 





Strain Derivative 


ATCC 
No. 


40 


pCF1CS/C8f 


67956 




pCF1AB/C12f 


67952 




pCF1EF/14c 


67949 




pCF1EF/15e 


67954 




pCF1AB/C25c 


67958 


45 


pCF1EF/C33c 


67953 




pCF1EF/C33f 


67050 




pCF1CD/33g 


67951 




pCF1CD/C39c 


67955 




pCF1EF/C40b 


67957 


50 . 


pCF1EF/CA167b 


67959 



The following strains were deposited on May 12, 1989. 
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Strain 


ATCC 




No. 


Lambda gt11(C35) 


40603 


Lambda gt10(b^-5a) 


40602 


D1210(C40b) 


67980 


D1210(M16) 


67981 



10 



The deposited materials mentioned herein are intended for convenience only, and are not required to 
practice the present invention in view of the descriptions herein, and in addition these materials are 
incorporated herein by reference. 



75 



Industrial ^licability 



25 



30 



35 



40 



The invCTtion. in the various manifestations disctosed herein, has many industrial uses, some of which 
are the following. The HCV cDNAs may be used for the design of probes for the detection of HCV nucleic 
acids in sanr^iles. The probes derived from the cDNAs may t>e used to detect HCV nucleic acids in, Jor 
example. -chOTtcaJ synthetic reactions. They may also be used in screening programs for anti-viral agents, 
to determine the effect of the agents in inhibiting viral replication in cell culture systems, and animal model 
systems. The HCV polynucleotide probes are also useful in detecting viral nudeic acids in humans, and 
thus, may serve as a basis for diagnosis of HCV infections in humans. 

In additimi to the above, the cDNAs provided herein provide information and a means for synthesizing 
polypeptides containing epitopes of HCV. These polypeptides are useful in d^ecting antibodies to HCV 
antigens. A series of immunoassays for HCV infection, based on recombinant polypeptides . containing HCV 
epitopes are descril>ed herein, and will find commercial use in diagnosing HCV induced NANBH. in 
screening blood bank donors for HCV-caused infectious hepatitis, and also for detecting contaminated blood 
from infectious blood donors. The viral antigens will also have utility in monitoring the efficacy of anti-viral 
agents in animal model systems. In addition, the polypeptides derived from the HCV cDNAs disclosed 
herein will have utility as vaccines for treatment of HCV infections. 

The polypeptides derived from the HCV cDNAs. besides the above stated uses, are also useful for 
raising anti-HCV antibodies. Thus, they may be used in anti-yCV vaccines. However, the antibodies 
produced as a result of immunization with the HCV polypeptides are also useful in detecting the presence 
of viral antigens in samples. Thus, they may be used to assay the production of HCV polypeptides in 
chemical systems. The anti-HCV antibodies may also be used to monitor the efficacy of anti-viral agents in 
screening programs where these agents are tested in tissue culture systems. They may also be used for 
passive immunotherapy, and to diagnose HCV caused NANBH by allowing the detection of viral antigen{s) 
in both blood donors and recipients. Another important use for anti-HCV antibodi^ is in affinity chromatog- 
raphy for the purification of virus and viral polypeptides. The purified virus and viral polypeptide prepara- 
tions may be us^ in vaccines. However, the piiriiied vims may also be useful for the development of cell 
culture systems in which HCV replicates. 

Antisense polynucleotides may be used as inhibitors of viraf replication. 

For aMweniwrce. the anti-HCV antibodies and HCV polypeptides, whether natural or recombinant, may 
be packaged into kits. 



50 



55 



Claims 

t. A recomUnant polynucleotide comprising a sequence derived from HCVcONA, wherein the HCV 
cDNA is in clone 131. or clone 26j. or clone 59a. or clone 84a, or clone CA156e. or clone 167b. or clone ; 
pi14a, or ctone CA216a, or done CA290a. or clone ag30a. or ctone 205a. or done 18g, or clone r16jh. or 
wherein the HCV cDI^ is of a sequence indicated by nucleotide numbers -319 lb 13^8 o^ to 8866 in 

Fig. 17. ■ . -.4^'^;^^ 

2. A recombinant polynudeotide according to daim 1. encddmgffjB^ 

3. A recombinant vector comprising the polynudeotide of claim; Korjtladr^^ 

4. A host ceil transformed with the vector of claim 3. ; 
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5. A recombinant expression system comprising an open reading frame (ORF) of DMA derived from the 
recombmant polynucleotide of claim 1 or claim 2, wherein the ORF is operabty linked to a control sequence 
compat^te wrth a desired host. 

6. A cell transformed with the recombinant expression system of claim 5. 
5 7. A potyp^tide produced by the cell of claim 6. 

8. A purified polypeptide comprising an epitope encoded within HCV cDNA wherein the HCV cDNA is 
of a sequence indicated by nucleotide numbers -318 to 1348 or 8659 to 8866 in Rg. 17. 

9. An immunogenic polypeptide produced by a ceil transformed with a recombinant expression vector 
comprising an ORF of DNA derived from HCV cDNA, wherein the HCV cDNA is comprised of a sequence 

TO derived from the HCV cDNA sequence in clone CA279a, or clone CA74a, or clone 13i, or clone CA290a, or 
clone 33C or done 40b, or clone 33b, or clone 25c, or clone 14c, or clone 8f, or clone 33f, or clone 33g. or 
clone 39c. or done 15e. and wherein the ORF Is operably linked to a control sequence compatible with a 
desired host 

10. A peptide comprising an HCV epitope, wherein the peptide is of the formula 
15 AAjfAAy, 

wherein x arKi y designate amino acid numbers shown in Rg. 17, and wherein the peptide is selected from 
the groi^ conasting of AA1-AA25, AA1-AA50. AA1-AA84. AA9-AA177, AAl-AAIQ, AAS^VtAgO, AA2^-AA25^Jv 
AA35-AA45, AA50-AA100. AA40-AA90. AA45-AA65, AA65-AA75. AA80-90. AA99-AA120, AAte-AAIIO. 
AA105-AA120. AA100-AA150, AA150-AA200, AA155-AA170. AA190-AA210, AA200-AA250. AA220-AA240. 

20 AA245-AA265. AA250-AA300, AA2g0-AA330, AA290-305. AA300-AA350. AA31G-AA330, AA350-AA400. 
AA380-A^^. AA405-AA495. AA400-AA450. AA405-AA415, AA415-AA425, AA425-AA435. AA437-AA582. 
AA450-AA5CKJ. AA440-AA460. AA460-AA470, AA475-AA495, AA500-AA550. AA511-AA690. AA515-AA550. 
AA55O-AAe00, AA550-AA625. AA575-AA605. AA585-AAS00. AA600"AA65a AA600-AA625, AA635-AA665. 
AA650-AA700. AA645-AA6aO, AA700-AA750. AA700-AA725. AA700-AA75a AA725-AA775, AA770-AA790. 

25 AA750-AA800. AA800-AA815, AA825-AA850. AA850-AA875. AA800-AA850. AA920-AA990. AA850-AA900, 
AA920-A^45, AA940-AA965, AA970-AA990, AA950-AA100D. AA1000-AA1060. AA100C>-AA1025. AA1000- 
AA1050. AA1025-AA1040. AA1 040-AA1 055. AA1075-AA1175, AA1050-AA1200. AA107CKAA1100. AA1100- 
AA1130. AA1140-AA1165. AAI192-AA1457, AA1195-AA1250, AA1200-AA1225. AA1225-AA1250, AA1250- 
AA1300. AA1260-AA1310, AA1260-AA128a AA1266-AA142a AA1300-AA1350. AA1290-AA1310. AA1310- 

30 AA1340. AA1345-AA1405, AA1 345-AA1 365, AA1350-AA1400. AA1365-AA1380. AA1380-AA1405, AA1400- 
AA1450. AA1450-AA1500. AA1 4eO-AA1 475. AA1475-AA1515, AA1475-AA1500. AA1500-AA15S0, AA1500- 
AA1515. AA1515-AA1550, AA1550-AA1600. AA1545-AA1560. AA1569-AA1931, AA1570-AA1590. AA1595- 
AA1610. AA1590-AA1650, AA1610-AAie45. AA1650-AA1690. AA1685-AA1770. AA1689-AA1805, AA1690- 
AA1720. AA1694-AA1735. AA1 720-AA1 745. AA1745-AA1770, AA1750-AA1800. AA1775-AAiaiO. AA1795" 

35 AA1850, AA1850-AA1900, AA1900-AA1950. AA190D-AA1920.,AA19ie-AA2021. AA192D-AA1940! A/Cl949- 
AA2124. AA1950-AA2000. AA1 950-AA1 985, AA1980-AA2000. AA2000-AA2050. AA2005-AA2025. AA2020- 
AA2045. AA2045-AA2100, AA2045-AA2070. AA2054-AA2223. AA2070-AA2100, AA210a-AA2150. AA2150- 
AA2200. AA2200-AA2250. AA2200-AA2325. AA2250-AA2330. AA2255-AA2270. AA2265-AA2280, AA2280- 
AA2290. AA2287-AA2385. AA2300-AA2350. AA2290-AA2310, AA2310-AA2330, AA2330-AA2350. AA2350- 

40 AA2400. AA2348-AA2464. AA2345-AA2415, AA2345-AA2375, AA2370-AA2410. AA2371-AA2502. AA2400- 
AA2450. AA2400-AA2425, AA241 5- AA2450. AA2445-AA2500. AA2445-AA2475. AA2470-AA2490, AA2500- 
AA2550. AA2S05-AA2540, AA2535-AA2560, AA2550-AA2800. AA2560-AA2580. AA2600-AA2650, AA2605- 
AA2620. AA262G-AA2650, AA2640-AA2660, AA2650-AA2700, AA2655-AA2670. AA2670-AA2700, AA2700- 
AA2750. AA2740-AA276G. AA2750-AA2800. AA2755-AA2780, AA2780-AA2830, AA27B5-AA2ai0. AA279e- 

45 AA2886. AA2810-AA2825, AA2800-AA2850, AA2850-AA2900. AA2850-AA2865. AA2885-AA2905. AA2900- 
AA2950. AA2910-AA2930. AA2925-AA2950. AA2945-end(C' tenninal), 

1 1 . A polypeptide comprised of the peptide of claim 10. 

12. An Immunogenic polypeptide attached to a solid substrate, wherein the polypeptide is according to 
claim 7. or ctedm 8, or claim 9. or claim 10. or claim 11, or wherein the polypeptide is comprised of an 

50 epitope encoded within HCV cDNA wherein the HCV cDNA is of a^sequence indicated by nucleotide 
numbers -319 to 1348 or 8659 to 8866 in Fig. 17. 

13. A monoclonal antibody directed against an epitope encoded in HCV cDNA, wherein the HCV cDNA 
is of a sequence Indicated by nucleotide numbers. -319 to 1348 or 8659 to 8868 in Fig. 17; or is the 
sequence present In clone 13i, or clone 26j, or clone 59a. or clorie a4a. or clone CA156e. or clone 167b. or 

55 clone f^14a. or clone CA2l6a, or clone CA290a, or clone ag3da; or clone 205a, or clone'lSg, or clone 16jh. V 

14. A jMeparation of purified polyclonal ^"^^^ffl^jl^f ^ d ^ of an 
epitope encvKied within HCV cDNA. wherein ftle'rn^^ by nucleotide 
numbers -319 to 1348 or 8669 to 8866 in Rg. 17, or Is tt^^^^^ or clone 26], or 
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clone 59a. or done 84a, or clone CA156e. or clone 167b. or clone pi14a, or done CA216aor clone 
CA290a. or clone ag30a. or done 205a, or clone 18g, or clone 16jh. 

15. A polynucleotide probe for HCV, wherein the probe Is comprised of an HCV sequence derived from 
an HCV cDlviA sequence indicated by nucleotide numbers -319 to 1348 or 8859 to 8866 in Fig. 17. of^rom 

5 the complement of the HCV cDNA sequence. 

16. A kit for analyzing samples for the presence of polynucleotides from HCV comprising a poly- 
nucleotide probe containing a nucleotide sequence of about 8 or more nucleotides, wherein the nucleotide 
sequence is derived from HCV cDNA which Is of a sequence indicated by nucleotide numbers -319 to 1348 
or 8659 to 8866 In Fig. 17. wherein the polynucleotide probe Is in a suitable container. 

10 17. A kit for analyzing samples for the presence of an HCV antigen comprising an antibody which 
reacts immunologically with an HCV antigen, wherein the antigen contains an epitope encoded vrithin HCV 
cDNA which is Of a sequence indicated by nucleotide numbers -319 to 1348 or 8659 to 8866 in Rg. 17, or 
wherein the HCV cDNA is in clone t3i, or clone 26j, or clone 59a, or clone 84a, or clone CA156e, or clone 
167b, or clone pi14a, or clone CA2l6a. or clone CA2g0a. or clone ag30a, or clone 205a. or clone 18g. or 

IS clone 16jh. 

18. A kit for analyzing samples for the presence of an HCV antibody cdmprisingjan, antigenic 
polypeptide containing an HCV epitope encoded within HCV cDNA which is of a sequence ''jhidicated iDy ' 
nucleotide numbers -319 to 1348 or 8659 to 8866 in Fig. 17. or is in done 131. or clone 26j, or clone 59a. pr 
clone a4a, or clone €A156e, or done 167b. or clone pi14a, or clone CA216a, or clone CA290a. or clone 

20 ag30a. or clone 205a. or clone 18g. or clone 16jh. * 

19. A kit for analyzing samples for the presence of an HCV antibody comprising an antigenic 
polypeptide expressed from HCV cDNA in done CA279a, or clone CA74a. or done 13i, or clone CA200a, or 
done 33C or clone 40b. or clone 33b, or clone 25c. or clone 14c. or clone 8f, or done 33f. or clone 33g, or 
clone 39c, or clone 15e, wherein the antigenic polypeptide is present in a suitable container. 

25 20. A method for detecting HCV nucleic acids in a sample comprising: ; 

(a) reacting nucleic acids of the sample with a polynucleotide probe for HCV, wherein tfie probe is 
comprised of an HCV sequence derived from an HCV cDNA sequence is of a sequence indicated by 
nucleotide numbers -319 to 1348 or 8659 to 8866 in Fig. 17. and wherein the reacting is under conditions 
which allow the formation of a polynucleotide duplex between tfie probe and the HCV nucleic acid from the 

30 sample. 

(b) detecting a polynucleotide duplex which contains the probe, formed in step (a). 
21 , An immunoassay for detecting an HCV antigen comprising: 

(a) incubating a sample suspected of containing an HCV antigen with an antibody directed against an HCV 
epitope encoded in HCV cDNA. wherein tiie HCV cDNA is of a sequence indicated by nucleotide numbers 

35 -319 to 1348 or 8659 to 8866 in Rg. 17. or is tiie sequence present in done 13i, or clone 26j, or clone 59a. 
or clone 843. or clone CA156e, or clone 167b, or clone pi14a, or clone CA216a. or done CA290a, or clone 
ag30a, or clone 205a. or clone 18g, or clone 16jh. and wherein tiie incubating is under conditions which 
allow formation of an antigen-antibody complex: and (b) detecting an antibody-antigen complex formed in 
step (a) which contains the antibody. 

40 22. An immunoassay for detecting antibodies directed against an HCV antigen comprising: 

(a) incubating a sample suspected of containing anti-HCV antit>odies vnth an antigen polypeptide 
containing an epitope encoded in HCV cDNA. wherein the HCV cDNA is of a sequence indicated by 
nucleotide numbers -319 to 1348 or 8659 to 8866 in Fig. 17, or is ttie sequence present in clone 13i, or 
clone 26j, or clone 59a. or clone 84a, or clone CA156e. or clone 167b, or clone pi14a, or clone CA2l6a, or 

45 clone CA290a. or clone ag30a. or done 205a. or done 18g, or done t6jh, and wherein the incubating is 
under conditions which allow formation of an antigen-antibody complex; and 

(b) detecting an antibody-antigen complex formed in step (a) virhich contairis tiie antigen polypeptide. 

23. An immunoassay for detecting antibodies direct^ ag^nst'an HCV antigen comprising: 

(a) incubating a sample suspected of containing ariti-HCV antibodies with the polypeptide of clairn 9, , 
50 under conditions which allow formation of an antigeri^antibpdy complex: and . 

(b) detecting an antibody-antigen complex formed in step (a) which coritains tiie antigen polypeptide. 

24. A vaccine for ti-eatment of HCV infection comprising an immunogenic polypeptide containing an 
HCV epitope encoded in HCV cONA. wherein tfie HCV cDNA is Of a sequence indicated by nucfeotide 
numbers -319 to 1348 or 8659 to 8866 in Fig. 17 or is the sequence present in done 13i. or clone 26j, or 

55 clone 59a. or done 84a. or clone CAISOe*; or clone 167b. or done pi14a, or clone CA216a, or clone 
CA290a. or clone m3Qa^iiOf(ijc\oj^^L^^ ^ t6ih. and wherein tiie immunogenic 

polypeptide is present in a pharml^I^^p^ phanrnaceuticatty acceptable exqpient. 

25. A metiiod for producing antib^ to ah individual an isolated 
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Immunogenic polypeptide containing an HCV epitope encoded in HCV cDNA, wherein tiie HCV cDNA is of 
a sequence indicated by nucleotide numbers -319 to 1348 or 8659 to 8866 in Fig. 17. or is of the sequence 
present in clone CA279a, or clone CA74a, or done 13i, or clone CA290a, or clone 33C or clone 40b, or 
clone 33b, or clone 25c. or clone 14c. or clone 8f. or clone 33f. or clone 33g, or clone 39c, or clone 15e. 
and wherein the Immunogenic polypeptide is present in a phanmacologtcally effective dose in a pharmaceu- 
tically acceptable excipient. 

26. An antisense polynucleotide derived from HCV cONA, wherein the HCV cDNA is that shown in Fig. 

17. 

27. A method for preparing purified fusion polypeptide CI 00-3 comprising: 

(a) providing a crude cell iysate containing polypeptide CI 00-3. 

(b) treating the crude cell Iysate with an amount of acetone which causes the polypeptide to 
precipitate. 

(c) isolating and solubilizing the precipitated material, 

(d) isolating the C100-3 polypeptide by anion exchange chromatography, and 

(e) further isolating the C100-3 polypeptide of step (d) by gel filtration. 

28. A method for preparing an HCV polypeptide comjOTSing: t 

(a) providing a host cell transformed with a recombinant expression system comprising' hn open 
reading frame (ORF) of DNA derived from HCV cDNA, wherein the HCV cDNA is in clone 13i. or clone 26], 
or dona 59a. or clone 84a. or done CA156e. or clone I^Tb, or clone pi 14a, or clone CA216a. or clone 
CA290a. or clone ag30a. or clone 205a; or clone 18g, or clone 16jh. or wherein the HC\{ cDNA is of a 
sequence indicated by nucleotide numbers -319 to 1348 or 8659 to 8866 in Fig. 17, wherein tiio ORF is 
operably linked to a control sequence compatible with a desired host; and 

(b) incubating the host cell under conditions with allow expression of the HCV polypeptide. 

29. A metiiod for preparing an immunogenic HCV polypeptide comprising: 

(a) pre>viding a host cell transformed witii a recombinant expression vector comprising an ORF of 
DNA derived from HCV cDNA. wherein the HCV cONA is comprised of a sequence derived from the HCV 
cDNA sequence in done CA279a. or clone CA74a. or clone 13i. or done CA2g0a, or clone 33c, or clone 
40b. or clone 33b. or clone 25c. or clone 14c. or done 8f, or done 33f. or clone 33g. or done 39c. or clone 
15e. wherein the OEF is operably linked to a control sequence compatible with the desired host; and 

(b) incubating the host cell under conditions which allow expression of the HCV polypeptide. 

30. A method for preparing a host cell transformed witii a recombinant polynucleotide comprising a 
sequence of HCV cDNA derived from the HCV cDNA in done 13i. or done 26j. or clone 59a, or done 84a. 
or clone CA156e. or clone 167b, or clone pi14a. or clone CA216a. or clone CA29Qa. or clone agSOa, or 
clone 205a. or done 18g, or done 16jh. or wherein the HCV cDNA is of a sequence indicated by nucleotide 
numbers -319 to 1348 or 8659 to 8866 in Rg. 17 comprising: 

(a) providing a host cell capable of transformation; 

(b) providing the recombinant polynucleotide; and 

(c) incubating (a) with (b) under conditions which allow transfonmation of the host cell witii the 
polynucleotide. 

31. A method for preparing a recombinant polynucleotide comprised of a sequenca of HCV cDNA 
derived from the HCV cDNA in clone 13i, or done 26j. or done 59a, or clone 84a, or done CA156e. or 
clone 167b. or clone pi14a. or done CA216a» or done CA290a. or clone ag30a, or done 205a, or done 18g, 
or clone 16jh. or wherein the HCV cDNA is of a sequence indicated by nucleotide numbers *319 to 1348 or 
8659 to 8866 in Fig. 17 comprising: 

(a) providing a host cell transformed vnth the recombinant polynucleotide; and 

(b) isolating said polynucleotide from said host cell. 

32. A method for preparing blood free of HCV comprising: 

(a) providing a sample of blood suspected of confining HCV and anti-HCV antibodies; 

(b) providing an Immunogenic polypeptide prepared according to claim 28 or 29; 

(c) incubating the sample bf^(^ with tiie immunogenic polypeptide of (b) under conditions which allow 
the formation of antibody-HCV polypeptide complexes; 

(d) detecting the complexes formed in step (c); and ; 

(e) saving the blood .fronri which complexes were not deteded in (d). 

33. A method foi' preparing blood ^f^^ 

(a) providing nucleic acids from a sample of blood suspected of containing HCV polynucleotides; 
. V is comprised of an HCV sequence derived from an 

HCV cONA w^^^ by nucleotide numbers -319 to 1348 or 8659 to 8866 in Fig. 
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(c) reacting (a) with (b) under conditions which allow the formation of a polynucleotide duplex 
between the probe and the HCV nudeic add from the sample; 

(d) detecting a polynudecrfide which contains the probe, fonnned in step (c); and 

(e) saving the blood from which complexes were not detected in (d). 

5 34. A method for produdng a hybridoma which produces anti-HCV monoclonal antibodies comprising: 

(a) immunizing an individual with an immunogenic polypeptide containing an epitope encoded in HCV 
cDNA, wherein the HCV cDIMA is HCV cDNA in done 13i, or clone 26i, or clone S9a, or clone 84a. or clone 
CA156e, or clone 167b. or clone pi14a, or done CA216a. or done CA290a. or done agSOa. or clone 205a. 
or clone 18g, or clone 16jh. or wh^n the HCV cDNA is of a sequence indicated by nucleotide numbers 

10 -319 to 1348 or 8659 to 8866 In Rg. 17; or 

(b) Immunizing an indivldial wflh an immunogenic polypeptide prepared according to claim 29; 

(c) immortalizing antibody produdng cells from the immunized individual; 

(d) selecting an immortal cell which produces antibodies which react with an HCV epitope in the 
immunogenic polypeptide of (a) or (b); and 

75 (e) growing said immortal celL 



zo 
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Translation of wui' I2f 

omT»Mim«siccac«rqc«oociaaK«^^ 
ig»ceiaTOeccccTTajjma!«GM e we»6icu:ii^^ 




ACCCOIGTTCGTAGCGCAGGACCCGCTAATTCRCXCICXTGCAGCAAGAGG^^ 



■»ni - -v-i5Ii^if^2i^'''^9ValCfsSetCysr*uTrpMetMetL«il*uIleS^^ 



-Overlip with 141- 



TCaCCGAAACCTCTTQSACamxcaUlTTACGTCCTAGGGACCGGC^ 
val 

421 TTGTATC 
AACATAC 



FIGORE 1 
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Translation o£ OKA M'l 



CGa»TMTC*m08SrtGCCrCCGCCGGGCCTGGTCGC»»aWICACGACCGTGATGG 

«s<ieiiiiaBucQXkTMa(»«icscTTCtaic^^ 

CACTtmCTATCCCtCaWAAGCACCAATTGTTATGGTCCGCXGGajS^ 

301 TCGGTTGXACnCGAXCAACTCAACTGGAXtaurCAMGTGTaGSAGC^ 

AGCCAACATOacCtACTTGAGTTGACCTAAGTGGTTTOKa^KSSwA^ • 

agtacccicccogcocsttcttstsggacgtsacggwtSSS^^ 

421 cGSssi;s§gsss?i2gg?giraj^^^ 

CCCIGCCGWtJaG*<»6CCACGCCSJU««:CA0G«^ 

j^ASSJi-^--^"^^^ 

- tGAtCGGCAaTCCGfcAACCGTAATAGGAACAtGSTAGTTGATGTGATATAAATTTTAGT 



JJtJ^^yayValGluHlaArglAuGluAlaAlaCytAanTrpThpArgGlyGlu 




ArTCysAspLcuGluAspArgAaipArgSsxClulAuSerPreUuLeuLeuThrThrThr 

601 AACGIIGCGMCTGGAAGAT^ 

tTGCAAOBCIACACCrrCTATCCCTGTCCAGSCTCGACTCGCGCAAtCACGAC^TGAT 

GTGTCA0CSICCAIWAG66CACAASGAAGTGTTGGGACGGICGCAACACGTGCCCGGAGT ^ 
""rr"?*^}??,***** Combined ORF of DNAs x:f through ISe — 

AGGlXWAGGTGGTCTTGTAACACCTGCACGlCATCMCAtGCCCaCCCCAGTTCGTACC 

— * ,■*■■■* 
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5SISJS5?SS?5^"®™SATGATGCt*CTCATATCCCAAScaGwSG^ 
C5«<»CSA<WACCAACACCTACtAC0ATSJW»Xt»aWT^CC^^ 



TG«a«:CAITAT(»ATTAC6XCGTAG«»CaWCCCIGCe^^ 



ICS 



961 TG?^SgSS?Si;SSSgfe^Jg3Sg?SJ|^^ 



«cci5iccc:icc»cc«a:Bei»K«»cco:cACAjo»s»«a6cc^ 



»«»C«!I«m»*ISTIC0C<aiAT«5TC0JKaiCSMaiCCMC^^ 



»ciG«Tne»ccTTCscoiMACsi«3vc»aa«jua»5TOMra 



CCCTCCOACAGTAGAAtGAGTACACACSACATWMCtSACACCATAAACTGTACTeOT 



1321 AATTGCTGCtGGCCGTCTTCGGACCCCTTTGGATTCTTCAAGCCAG 
TTAACGACGACCCGCAGAAGCCTGGGGAAACCtAAGAAGTTCCGtC 



FIGURE 2 -2 
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Translation of oia I5e 
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» ■ 



Translation of ONA 131 
AGTAGCCTCCCCGCCCGmTTGTGOSACGTGACGGGGTGACTAACGAwSOTTcS^ 



61 



121 



301 



361 



421 



481 



•Ovwrlap with 12f— — — 



ki???S??/i?l^5f!f**P*'9**P*'9SerGluLeuSerProLeuL«uLeuTj»«^ 



6«TC*CCSTeCAOaiOG<K»ai»OGMSTSTISsS^ 



FIGURE 4 
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Translation of DMA 26j 



1 G^?S?S?2i?ggSgS^J?g^^ 

cgaawsatagtggxgttcaagxtcawuvgtccgJcSaEJc?^^ 



TG(5«aUTWCTAAAACTGGTCCCSACCCCGSGATMTcST^^ 



121 cSlS§?§!?5?gSg;?gS2KsaK??^^^ 

GCTGGTCGCGGGCAT6AC(»CCGTCAJGGGGGctJ?S^ 
CTCACACACACCAGOCCATATAACGAiMSTGAGGGTCGGGGCACCACCACCC 



^vt^^^^■»-,Ji■^'■...^3^-^,.^.^ 



FIGURES 
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Translation of DMA CA59a 

AACC»mCCGAGTCGACGAGGCCrA<WGTGtTC(WIJUaUMOTGTAC^^ 
"isTrpGlyValLeuAlaGlylleAlaTyrPheSerMetValGlvAsnTroAlaLwval 

61 c^^tggsgagtcctggccggcatagcgtattcctccatggtSg^ 

GTGACCCCTCAGGACCGCCCGTATCGCATAAAGACGTACCACOcSScC 

, i^!JiiJ2j}f*ii***^^"PheAlaGlyValAspAlaGluThrltt8Valihr61ySl»S« 
121 ctcgtactgctgctgctatitcccggcgtcgacgcggaaaccSostcacSgg^ 
gaccat(»cgacgacgataaacgcccgcagctgcccctttg^tSagtScccS 

, ^*'5^y"^»^'»^v»lSerGlyPheValSerLeuLeuAlaPrciGlyAlaLvsGliiAai^al 

GTCGACTAGTTGTGGTTGCCGTCAACCGTGGAGTTATCGTGCCGGGACTTGACGTBCTA 

, ?*5i:?ii*!'»^^^5^^g^'J*l»GlyLeuPheTyrHisHisLyaPheA«nSerS«Gl7 
i5£CTCAACACCGGClXMTTGGCAGGGCTTTTCXATCACCACAAGTTCAACICra 

tcggagttgtggccgaccaaccgtccccaaaagatagtggtcttcaagttgS^ 

- - Overlap with 26j— : 

—Overlap with K9-1 

CysProGluArgLeuAlaSerCysArctPro 
361 TGTCCTGAGAGGCTAGCCAGCTGCCGACCCC 
ACAGGACTGTGC6ATCGGTCGACGGCTGGCG 



FIGURE 6 
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Translattcm of ONA CA84a 



61 ATAJSSSil?Sll?^SJ2«2?*}£Al?^ 



121 



181 



241 




CCGACGAGGCCTAGG 

aiGT,CaXAGAACCISTAcJ§g2g?|Igii^ 
piri*"~ — Overlap with CA59a-— 

TCGACGC(BAAACCCACGTCAcSgG6 
AGCTGCGCTXTGGGTGCAGTGGCOC 






FIGURE 7 
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Translation of ONA CA156e 




CCCACCGCTACTGGGGATGCCACCWTGGTC^^^ 



•IGGAASASAGGGICCGC 



LeuArglleProGlnAla 
GCTCCGGATCCCACAAGCC . 
CGACGCCTAGGGTGTTCGG ■ ' 



FIGURE 8 
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Translation of DKA CAl$7b 



*"r V7~~"~~0verlap with CAlSSc— — 



«^*i5iy**Pi«uCy«ClySerValPheLeu 
GCACCCCCTGAACACGCCCACACAGAAAGAAC 



FIGUM9. 
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^•aaUtldtt of SNA CUlCa 

301 VSaKThnMCeeceeaaGCCMieeveeArMf^i^^ 



MaCMQCTTOQCi 



^_'!cg«?CCTccAc>CTccooqqTO 

CGSCX*OBSI»OQ»C6X«8J««CCCCK0C*«iaAAC^^ 



tooccCTSccs>«ieeyMfrc>co » ajuasc<MMC(g>SS 



FIGURE 10 
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Ifeuwlatim e£ XMk cUflOa 

CtetCTMCMCOICCICmtG*^^ 

scxocscxrxeMMKxncsccMra^ 



CS<MOS«KX66666C06CAS0ttaCCO6TZAM^aSKCACTAaeSM^UM^ 



cqc eqM fles6CTSflMi!!MCccAWiMoo o QA o cj>acogcqoqGAaMecwe^^ 



-Orwrlap with cxaifa- 



qqieeccG6»ccce8«ccBCAccccciu>aAccTiciocega>CTi<uti>e cro iie o^ 



481 AMCTCcSSScTfflSSSSSe 
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te&AsXa^ion of DMA «930« 



MetSerVaXVilGtaProPi^GtyProPrQili«u 



MetAlaLeuv«iOP 
OCGTCriTCGCMATCaeXACCOC^ 

JWK^CCCTCTOGOTATCACCAGACGCCTTIKXr^ 



f^tetrroGlyAspLeuGLTVUProrrosloAjp 

121 OWirCTTTCraGfflTCAACCCCCTCAArfiCC^^ 

GCCC^CCAAAaAACCXAGTTGGCCGACTaCGGACCTCT^ 

CfTM OP AM GljAlaCr* 

191 CTGCTAGCCGAGTAGTOTTGGGTCGCGAAI^SGCCTTC»OTACTCCC^ 
GACGATCGGCTCATaCAACCCACCGCTTCCGGAW:RC(^ 

GluCya?roGlyArgSerArgArgPro(^sThit!itSerThrA3nrro^ 

24 1 GCGAGXCCCCCGGGAGGTCTCGTAGACCGTGCACCAlGAGaUXiAATC 

CGCTCACGGGGCCCTCCAGACCATCTCaaCCTCCTACTCGTGCraACGATTKGJ^ 

LyaAonLyaArgAanThrAanArgAr^roGlnAapVAlLyaPheProOlyGlTGlyGIn 



301 AAAAAAACAAACGTAACACCAACCGTCGCrCACAGGACGTCAMTTa 
CTTTTrTCTTTCCATTClWTTGCCACC<WTCT<X7CCA(:^^ 

ZlvValCIyClyValTyrlAuLeuPrckrgAr^QlyPrQAr^teuGIyVmlArgAlatYir 

361 AGATOGTroCTCGACrrXACTTCTTCCCGCGCAGGGCCCCT^ 

rCTACCAACCAC C TCAAATCAACAACCCC;CCXCCCCCgCATCTAACCCA 

ArgLyaThrSerGluAjr^SerGlnPrd^rgGlyArtrAr^QlnProZlcProLyaM^^ 



421 DSAGAAAGACmCGAGCGGTCGCAACCICGAGGtAaAanCAaCCTATCOCCAA^ 
CCTCrarCTGAA^CrCGCCACCGTXGCJiaCTCCATCTGC^ 

ArgProGluOl'yArgThrTrpAIaGlifProGlyTyrProTrpPM&tUTysGIyM^ 



• Overlap with CA290»* 

4 ii GTCOKrcrGAGGGCAGGACCTOGGCTCACCCCGGGTACCCTTG^ 

CAGCCGGCCTCCCGTCCTGGACCCCAarCSGGCCCATCGGAACCGGCGACATA 
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eirCjaairrrpAl«Qiytrff«ui^oriTOArfGlyserte 



TO 



iacas«scccxcccGcc»Aca^^ 



601 CCACAOhCCC(XXXCGTKQGtaXSC^^ 

(SCTGtCMCC<»CCCCATCCAGCGCGTtAAACCCATTCCAGTAGCTATC 



Phft 



Wl GCTTC 
OGAAS 



« • Start of long HOT OP 

I - Putative firat aidLno said of large HCV polyprotein 
i - Pucative snail encoded peptidase that may plfty a 
trmAAlatienJil ra9uLatery rolo} 
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1 



121 



Xtuslttioa of xnoi CA305« 



CgrCTroGgtOgOSAIW3QCCTT QW t/IM 



tenthrAnAx9lUrgProOlBA«pV«lLy«Fbtfroai7alyai3^ 
MCACCJUUXSXCGCOCACAK 
TTGTGGnCCXJ^GttfeTCCSCauSTTCMGC^ 

valtyrLeuLeuPrcAraAroGlyrrougXitueir^^ 
l&l GTrUCmTIGCCGOGUOGGGC^^ 

ciJukTCuu^cAjycaaoQcaseccoMOMCTA^ 



.i.*...^*— «OvwlAp with CA290ft— ' — 

GiuAr9Ser6infroiorMiTArqAZfGinfrou«prGt»ilua^ 

241 6UCGGTGCCJU^CTC8Jlce»CA0C<rc^ 

CTCGCCMCOrtQ(3JW3CTCCATCTQCA6TCaQA 

ArgThnrpioaGlnrraSlrrnPnRpFraM 

301 JU»ACCK6GCTCMCC0GGGTACCCfft6CCCCCtCtA9^^ 
TCC«OQAO0OaMtO900COCAra«AACC00(^^ 

« - 9utatlv0 Initiator aethlotiiAe eoaoa 
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IProProOP — — — . 



*M«tS«rValV*lcinPtcP«)Cly»toP«stBuRro6lyCluProMt 
M&tAlaleuVdlCP 



121 GGTCTCCGGAACCGGTGAGTACACCGGJUkTlXK^^ 

CCAGACGCCCTGGCaCTCATGTCGCCTaACcSraeCT^ 



-Ov«rlsp with «gaOa- 



tKetPrdGlyjUpLeuGiyviftiPrtfXQeiiiAspCysm 

TlXWCCCACTTACCCACCTCTAAACCCCC 



OP AM CIfAXaCy0GIuCyaProGlyArgS«r 

241 TCGOTCGCGAAASGCCrrGTC»TACTCCCT<aia CG Sl g C m ^ 
ACCCAGCSCTTTCCG(aJUJlOCATGACGGACtt 



ArgArg 



301 CTCGTAGft 
CACCATCT 



♦ - Start of lonj HOT ORP 

» - Putatiiva small encoded p9txdes(tl*at nay pliy 
& translatlonal regalatocy rol^}« 



FIGOBE 14 
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61 



121 



181 



Translation of dna 16jh 
Overlap with I5e , 

CCCCGGACBATOAGGTATC^TGGmCCTAGATGGiwS^ 

6AGXCGCGXAMAGIGa(»TGXaVATGA<»GSccSM^^ 

Gly* 

cgatccgaagaccggtctcctccgtcccgacggtatSaSgttS 



AlaValArgThrLysLeuLys 
241 GCAGTAAGAACAAAGCTCAAAC 
CGTCATTCXTGmCXSAGTTTG 



- nucleotide heterogeneity 



FIGURE 15 
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5ggtMED ORF OF DMAs pil4a THROUGH ISe 

{ Pil4*/CM67b/CMS6«/CA84«/CAS9a/K9-l/12f/l41/llb/7f/7e/ 

ah/33c/40b/37b/35/36/81/32/33b/2Sc/l4c/af/33f/33g/39</ 
3S£/19g/2fig fi 15a) 

GAGAieCACCCCCTGSATACGCCOU^CAGAAAGAACaGCCGGMGiS^^ 



361 



CGWXCGACGAGGCCaGGGTCTTCGGTAGAACCTGTACIAGCSACCACGAGTfflCMCT 

57^fSSS5S2"*^^'*"^'^*=*«!GTCGGGAACTGGGCGAAGGTC^^ 
CAGGACCGCCCGnTOXaiAAJiUSAGGTACCACCCCTTGACCCSC^^ 

" CTCCTGCTAmeCCGGCGTGGAMCGGAAACCCACGTCACOMS^^ 
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TOJW»CMACCTAAACAATCGGAG<»UKOT(KTCaKGCtX^^ 

«n, *f2?JS^!lSiX^*'*^''i*^««A«'»S«rThriU«LeuJUinCyaA«nAspSerL«uAsn 
901 AACACCAACGGCAGTT<MCACCTCAATJWK»C(K(XCI^ 

TTGTGGTTKCGTCAACCGTSCAGTtATCCTGCCOWJWITTGACCTTACTA 

TCCGATCGGTCSACGGCTGGGGAATGGCTAAAACTGC^^ 

mo T 5i*^?2Siy!;Siy£"A«P<51nArgProTyrCysTrpHisTyrProProL^^ 
loai GCCAACGGAAGCGGCCCCGACCA)GCGCCCCtACTCCTG<»w:TACCa^ 

CGGTTGCCXTCGCCGGGGCTGGTCGCGGGGAMAOSACCGXGATGGGGGGTTTTG^ 

.... 5iyli«valProAlaLyaSerValCysGXyProValTyrCysPheThrProS«ProVa 
1141 GGTATTGTGCCCGCGAAGAGTGTGTGTGGTCCGGlATACTGCTTCACtCCCAGCCa^ 
CCAXAACACGGGCGCXTCTCACACACACaGGCCAtAtAACGAASTCAGGGTC^ 

, , Y*^'^*^^^y^^*"^'^P^9SerGlyAlaProThrryrs«rTrpGIyGluA5ciA«p ' 
1201 GTGGTGGGAACGACCGACAGGTCGGGCGCGCCCAKTACAGCT^ 

CACCACCCXTGCTGGCTGTCCAGCCCGCGCGGGTGGATGTCGACCCCACTXXTACTATGC 

AapValPheValI^uA»nAsnThrArgProProLeuGlyAsnTrpPheGlyCysThrTrp 
1261 GACGTCTTCGTCCTTAACAATACCAGGCaCCGCTGGGCAATKGTTCGGTTGtACCT^ 
CTGCAGAAGCAGGAATTGTTATGGTCCGGTGGCXSACCCGWAACCAAGCCAACATGG^^^ 

MetAsnSerthrClyPhcThrLysValCysGlyAlaProProCyaVallleGlyGlyAla 
1321 ATGAACTCA^CTCGATTCACCAAAGTGTGCGGAGCGCCTCCTTCXGTCATCGGAGGGGCG 
TACTTCAGTTGACCTAAGTGGTTTCACACGCCTCGCGGAGGAACACAGTAGCCT^^ 

, , ^ . GlyAsnAsnThrLeuHisCysProthrAspCysPheArgLysHlsProAspAlaThrTyr 
1381 GGCAACAACACCCTGCACTGCCCCACKiATTGCTTCCGCAAGCATCCGGACGC^ 
CCGTTGTTGTGGGACGTCACGGGGTCACTWVCGAAGGCGTTCGXWGC^ 

SerArgCysGlySerGlyProTrprieThrProArgCysLftuValAspTyrProTyrArg 
1441 TCTCGGTGCGGCTCCGGTCCCXGGATCACACCCAGGTGCCTGGTCGACTACCa 

AGAGCCACGCCGAGGCCAGGGACCTAGTGTGGGXCCACGGACafiCTGATGGGCATATCC 

1 S 0 1 CtXXGGCAXXAXCCITGXACCAXCAACXAaCCAXAXXIAAAAXCAGGATGXACGXGGGA 
GAAACCGXAAXAGGAACATGGXAGXXGATGTGGIAXAAAXTTXAGXCCXACATGCACCCX 

GlyValGluHlsArgLeuGXuAlaAlaCysAsnXrpXhrArgGlyGluArgCysAspLeu 
1561 GGGGXCGAACACAGGCXGGAAGCXGCCXGCAACTGGACGCGGGGC6AAC6XXGCGAXCTG 
CCCCAGCrXGXGXCCGACCXXCGACGGACGTIGACCXGCGCCCOGCXXGCAACGCXAGAC 

GIuAspArgAspArgSerGluLauSerProUulAUl^uThrXhrXhzGlnXrpGlnVal 
1621 GAAGACAGGGACAGGXCCGAGCtCAGCCCGTTACTGCTGACCACXACXCAGXGGCAGGTC 
CXXCXGXCCCXGXCCAGGCXCGA6Xa»;CAAXGACGACXGGXCAX6X6XCACCGXCCAG 

^^^^'"'^^ 

1681 v.. CXCCCGTGXTCCTTCAaUVCCCIACCaGCCTXGTCacaSGCCICATCCACCXCa^ 

GAGGGCACAAGGAAGTGXTGGGAXGGXa»;MCAGGXGGCCGGAGXAGGTGGAGGXGGrc 

AshlleValAspValGlnXyrLeuXyzGl^r^alGlySerSerlleAlaSerXrpAlalU 
1741 ■ AACATXGIGGACGXGCAGXACXXGXACGGGGXGGGGXOUU^CAXCGCGICCTGGG^ 
■ • ; .^XGXAACACCIGCACGXCAXGAAatGCCCCACa " 
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2101 



— >--'-«*««^^««3C«;cCC»ACTACCa:GACICAS^ 

2161 ^«2SSTS^S§3gSS§TSgSeS^^ 
ATGXTCSCSAXATAfiTCMCCAcSSg^^ 

2221 «clSSSS£TCTClJlTC?g|S?S^ 

2341 J?«5SJ£g§;S5SJSg§SJMsg?s^ 



2461 



FIGURE 16-3 
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5951'- r^3lrVi^h'^ti2^^^^'>'^^^Sex1hXkUtd^^ 



•2881 




sirccAicoaaasiMctoTBsauMmMseo^^ 




3121 




;Aca 
ir 



336r 



3481 
3541 
3601 



T(WATCCOTCTCAAG«AACGCCXGCOGC0»CGMCCCCCCCCGA* 
ACACT6CTCAC(WT6AC(STSCCTA0SCiaAfi6tW»ACCCGtMCCSlCACACSAACTC 

GTICGTCtCTGACGCCCCMCTCTGACCttCW»AGC©aJGG0QGTeGG6J^ 

ValThzValProKlsProAsnIIe61u6]uVUAlaUuS«zttxThxGly(31uZlePto 
3721 GTCACTGTGCCCCATCCCAACAICGA6G«GITGCXClGTCaca^^ 

CACKACACGG(»TAGGGTT6TA6CrCC1CCftACGiU»CMGXQ6XG6CC7C^ 

PheTyrGlyLysAla lleProLeuGluValileLysClyGlyAwHliLauIlaPhnCys 
3781 ItTTACGGCAAGGCTATCCCCCTCGAAGTilATaVAGGeGGGaAaA^ 
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3841 
3901 
3961 




4141 



4441 
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4801 



AATaraiIGTA(KrCGTtCCCTJ«:rACCACCa;CTC«I«i^ 



CGGGAGCCG 



4921 



4981 



5041 



GAG(»CCTCTOK«M«CMKCGT<^ 

aswescMCMiAGCcaieia^xiMcccciiceAOMSS^ 

=.01 ^^^S^SSiigSSSIigiSSlMSSigggSSS 

CACGGCCTCTCGCTACBTCMCGGGCSCMrGACGGXATCAGTCGTCGGAGTOACATTGG 

5521 aWCTCCIGAGGCGACIGCACCAGTCGATAAGCTCGGAGTGTACCACICCATSCXCCGGT 
GICGAGGACTCCGCIGAOGTGGTCACCTATTCGAGCCTOlCAlXWTGAGGtACGAGGCCA 

c<!oi 5HHS55"^**P^^*''P**P''P^^«CysGluValLeuSarAspPheLy«ThrTrp 
" JS™HS!^*^'*=^*C^^^A^CGAGGT6TTGA6CGACmAAGACCTGG 
AGGACCGATTCCCXGtAGACCCXSACClATACSCTCaaaCTCGCTGAAACTCTSGA 

KeA^ J^5^n-i«fy«l*«*«e«*«^l«»t««ProGIyllePro'i^^^^ 
5641 etAAAAGCTAAGCTOlTGCCACACClWCTGGGATCCCCTTTGIfiTCCTCCCAGCGOTGG 
GATmCGArrCCAGTAaMTGICGACGGACCCrAGGGGAAACKAgSAOSSTCGCGCCC 

iyHtysGlyvaXTrpATSValAspGlylleMetHisThrArgCysHisCysGlyAlaSlu 
5701 TATAAGGGGGTCIWGaaMSTGGACGGaVTCATGCACACTCGCTGCCACTGTCGAGCTGA^ - 
ATATTCCCCCA6AC06CICACCIGGCGTAGTAGGTGTCAG|;GACGGIGAq^CT . 

IleThrGlyHisValLy9AsnGIyThr^ta 
5761 ATCACTGGACAT6TCAAAA^CGGGA0aI 



5161 



5221 



5261 



5341 
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lAGRyiCCTGXAaGtTimcCCtGCXACTCCiaCXaGCCKXavTa^ 



MSAAGCTAGGCGAACWCGCCKCTCMGCTCSOXTCTMM^ 

SACC<XTICAGAGCCTCIAAGCaWTCCGGGJW:QGGCRAA^C^ 

©WGGCGATCACCtClGCACCITXWGGGCT^ScMGM^^ 



£601 



«K:GAAGGTG5AGSTnCAGGG6ACGAqw:« 



GACTGACmCTlGGGAtAGATSAaSGWCCcSSiGS 
5^7S^'^<3SCATTACGGa6Aa^^ 
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6901 



6961 



;tgc 

;CAGG 




CTCTSA10jrAGXAOTCATTCXTGCTCC«AAASACGCWGTCSG*CTCTTCCeCCC« 

TTaK;TCGAGCASAGTA<K»CAiU»K»K:tAGiVCCCGCACGCGCACACCCTXTTCXACCGA 

/ 44 1 TTGIMGACGTGGTTACmGCTCCCCTTGGCCGTGATGGGA^^ 

AACATGCTGCACCAA1GTTXCGAGG66AACCGGCACTACCCTTCGAGGATGCCTAASGTT 

cn , Jy^SS^S^oGlyGlnAr^ValGluPher^uValGlnAlaTrpLysSerLysLysThrPro 
7501 TACtCACCAGGACAGCGWTTGAATTCCTCGTCCAACCGTGGAAGTCCAAGAAAAC^^ 
ATGAGTGGTCCTGTCGCCClUlCTTAAGGAGCACGTTCGCACCTTCACGTTCTTTre^^ 

-P , «2SGlyPheSBrTyrAtpThrArgCysPheA«pS«rThrValThr«luS«rAspIlftA^ 
7561 ATGGGGXTCTCGTATCATACCCGCTGCTTTGACTCCACAGTCACTGAGAGOGACATCCGT 
TACCCCAAGAGCATAC£ATGGGCGAC6AAACTGAGGTGTCAG1!<^ 

TiS51wGluAlalicTyiGinCFdCy«A5pLeuAapPiro61n^ 
7621 ACGGAGGAGGCAATCTACquVTGTTGTCACCTCGACCCCCAAGCCCGCGTGGCCATCAAG 
TGCCTCCTCCGmGATGGmCAACACTGGAGCTGGGGGTTCGGGCGCACCTC 

SerLeuThrGlutegteutyrValGlyGlyProLfeuThrAsnSer ArgGlyGIuAsn^^ 
7681 TCCCXCACCGAGAGGGTTTAIGTTGGGGGCCCTCTTACCAATTCAAGGGGM 
AGGGAGreGCtCTCeGWKTACWVCC^^ 

' " "'^^ 
7741 GWCTAtj:G,ca;GG|S5CG^^ 

iK^^^^v-'" ■ ' • ■ ' ■ 



■•'''^>^'f:SvSSl^'^:^-^^^*^^ 16-8 



EP 0 388 232 A1 



7861 



CCGATAGCMCCAC<»ax:GCTaaGCATGACTGTXATCS*C»CCRTT«aGGGA 

A 

ICGCTCT 



7991 



8101 



8161 



8221 



8281 
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-319 CACTCCACCATSAATCACTCCCCnaSl^^ 
GTGASGTCGTACTXAGMAGGGCymCICCia}^ 

-25 9 CCa3!GGCGTa!AGaiATCAGMTCGTSC3U^^ 
GG!IMCGC!^XCMACTCACAGCACG1»^^ 

-199 GOfGGXCTGCGGAACCGGTGAfiMCaCCJGGAATTGC^^ 
CACC&GACGCCTTGGCCACTCATGIGGCCmAC^ 

-139 !KaACCC6Ca:CaATGCCTGGAGATra(^^ 

AGTTGGGCGAGmCGGACC7Ca!AAA0Cm:AC6GGGGCG^3?^^ 

- 79 GMGGGTCGCGAAAGGCCTT62?GGTaCam35SAl!A6G6 

CAACCCAGCGCIOTCCGGAACACCftaGaOGGACTASDCCC^^ 

- 19 GOfCPCGMGACCGTOCACC 

C&SASCA!FCTGGCACG3!GG 

Arcf Thr 

MetSexTteAsnProLysProGlnLyfiiLysAflnLystegATO 

1 ATGAGCACGAATCCaaU^ACCTCAAAAftWUUUlC^ 
iaCTCGTGC!rTAGGA!DTa!GGAGTm5^^ 

AspValLysPheProGlyGlyGlyGOLb^j^^ 
01 GACGTCAAGTXCCCGGGaXSGCGGTOlGAiraj^ 
CTOCAGXXCAftGGGCCCACCGCCWSBCiafi^^ 

GlyProArgLeuGlyValArgAlaOtoAr^ISsXhrSerGlUArgSerG 
121 GGCCCTAGAXTGGGTOTOOGCGCGafflMRAIifiA^ 
CCGGGATCOJAACCCACMQCaCQCaJSCTCCa^^ 

ArgArgGlaProIleProLysAlaArgArgProGluGlyArgTlirTrpAlaGl^ 
IBl AGACGarAGCCmCCCCAAG6CT0Gl0eGC0C6AGGGCAGGAC(^ 
ax:aXK:AGTCGGAa!AG6GGTTCCGMGACCCGGGC3^ 

TyrProTrpProI.eu3?yrGlyAsnSLuSlyCysGlyiPrpAla61y^ 
241 2aCCCraGGCa;CTC!DATGGCAATOAQQGCTGCGM 

ATGGGAACCGGGGAGATACTGTTACTOaBAOGCCCACCO^^ 

ArgGlySerArgProSerTrpGlyPT^hrAspProArgArgAr 
301 CGTGGCTCTCGGCCaS^OXSGGGCmaCASICCXXZTO 

GCACCGA6AGCCGGATCGACCCCGG«3SKTCaX3GGG6CaKy^ 

liygVillleAspatoLeuThrCyeSLyPhoiaaAsprieuM^ 
361 AAGGTCATCGAOy^CCTTACGTGOQGCa^^ 

TXCCAGXAGCTAOJGGGAATCCACGCOSAAGCGGCTG^^^ 

GlyAlaProLeuGlyGlyAlaAlategAlaLeuAlaHisGlyValArg^ 
421 GGOGCCCCTCX3?6GAGGCGCTGCCaGGGOCClGGC^ 

CCGCGGGGAGAACCTOGCGACGGTOCXSGGGftCOGCGTA^^ 

Thr 

GlTValAsnXyrAlaThrGlyAstiLKiProayCysSejTPheserilePheLe 

GGCGSGAACTATGCAAGftESGGAAOCa!^^ 
CCGCACTTOATACGTTCTCXX:XTGSW^0GAOC2^A^ 

LeurieuSerCystauoairVilProMLaSerAlaT^ 
541 CTGCrcXCTOXKTTGACTOTOCCaSCITCGKr^ 
GACGAGAGAACQAACTGACACGGGOGAMCCS^^ 

TvrHtsvalThrAsiiAspCyfiProasiiSftrSerlleValTy^ 
601 TACCACGTCACCAATGATTGCCCraMaXX^^ 

T^iiHisThrProGlyCyaValProCyOTalftrgGluGlyAsnAlttSer^ 

fifii eS§actSg6XG0^ 

GSrSGOTCcScRMGft^^ 



481 



Pig^ 17-1 
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AlaMetffltcProThrValAlaatoArgtepGlyLysLeuPro^ 
CGCIiU:!DGG6GASGCCAa:G6I?GGICCC^ 

HisIlellspLfiuL&uVUGlySsrAIaThrLeuCysSerAlaLeu?^^ 
781 CACATC6A!I^Cfa?G7CGGGAGCGKX:A^ 

CysGlySerVUPheOCieuValGlsrGlxiLeuPha^ 
841 OXSCGGGKaSTCTMCTTOTOGGCCRA^ 

ThrGlnfilTCyfiAsnC^sSerXIeTTrProGlyHisIleThrGlyHisAxgHe^ 

901 ACQCiJiisa'miCM^^^ 

TGCGTTCC&i^GTTJ^GGAGAaiAGAXaG^ 

Val 

AapMetHetHiatl^nTxpSerPxo^ChTThrAl^ V. - 

961 GATATGAJDGATGMCaXMTCCCCa^CG2W:GGCGT^ . . ' : : '-r:^' " - 

CTAOJJUITACiaCTTCACCAjSGGGMGC^ 

PtoGlnAiall^uAspMetlle^ 
1021 CCa^laASCawrCITCGACAMA 

GGTGX£C6G3!AGAACCax;7ACTA6CGAGCACX;^ 

TyrPhsSerSletValGlyAsQTrpAlaLye^V^ilI^ 
1081 XAraTCa?CCATG6a!GGGGAACa?GGGCXU^AGG3K^ 
ATAAAGAGG!EACCA<XrC3!TGAGCCGCa^ 

v;alAspAlaGlu!ChrHisv«aoteGlyGlySemaGlyHi8ThrVa^ 
1141 GXCGACSGGSGAAACCCACGTaOCGGGGGAAGTCCCGGCCAOlCttGTOT^ Fig, 17-2 - 

CAGCTGCGCCTTOSGGTGCaGTGGCOCCCX!!^ 

SerLeuLeuAlaiE^oGlyAlaLyeGlxiAsnV^aaGlnZauIle^ 
1201 AGCCrcC^TCGCACCAGGCGCCAAGCAGAACGTCCAGraGATCAACAC^^ 
TCGGAGGAGCGTGGTCX:GCGGTICGXCriTOC^ 

HisLeuAsnSeriEhrAlaiieuAsnCyaAsnAspSerlieaiAsnThr^ 
1261 CACCTCAA!Z!AGCACGGCCCTGAACn!GCAAaX^^ 

GTGGAGmXCGKCOGGGACTTGAGGOfl^^ ^ 

LeuPheTyrHisHlsLysPheAsnSerSerGlyCysProGlu^ 

1321 CTTTTCTASCACCACAAGTOXIA^^ 

QAAAAGA3!AGax;GTG£XCAAGlfa!GAGAAGa:CCGAC^^ 

ProIaua3uA8pPheABp6lnGly^pGlyProXleSer!IYrAlaA6DGlySer61^ 
1381 CCCCa!!CACCGA7TTTGACCAGG6C!)^G6(^ 

GGG6AA3;GGC!CAAAA(»76GTCCCGACCC0GG6A!E^ 

AspGlnArgProXyrcyss^pHlc^ryxi^roPTOLys^^ 
1441 GACCAGCGCm:ZACaX3CTGGCAC3!ACX!CCCCAAAACCTa?^^ 
CTGGXOGCGGGGAXGACGACCGOXSA!!^ 

SerValC^sGli^roVaiayrCyaPheXhrProSerPrafV^lVUV^a^ 
1501 AGIGl^roGTCOGGXATAtOX^C^C^^ 

ArgSerGlyAlaProThrTyrSerTrpGlyGluAanAepThrA^^ . 
1561 AGGax:G6GGGCGCCCACCTACAGCIGGGG!CGAAAATGATACG^ ' ' .-^^^ ^ . 

AsnThrAargProPrdLeuGlyAsnO^PheGlyCyaThrTrpMet^ - ' T 

1621 AAa!ACCa6GCC»CC6CS!GGGCAAaMKKMC3G6I^ . - . * . 

ajTAOXSGTCCXSGTGGCGACOCGraAACCAAGra v ' 
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ThrLysVaLqysGlyi^aProProCFsVallleGlyGlyJaaGlvAsnAsnThrLBuHls 
1681 iu:auu^!]S!ix»:G62mK:cTCC!ri6asTC&aK^^ 

TGGTTTOOCGOCZCSGCGSftGGAACACAGTAfiCCaCCCCGCCCGTlG 

CysP2»TfarAfipGysPhed^L7sHlsPrQA8pAlaThxT?rSerArgC7sGl^^ 
1741 TGCCGCJUSea3TOCTICCGCAA6CATCa36ACGC^ ■ 

Lea 

PrtwaplMteProArgc^^BLeuvalAflpTyrProTyr^ 

1801 CCCr66iUBUVC&CCC»G6XGCCTG6IQ3JU:TACCa3Z&TA66Cm 
ThrneAeaTyrlEbrnePheLydlleArgHetTyzValGlyGiy^ 

1861 ACCAICMCEZUaCCAXATTTAAiUiaX^^ 

61uMaAIaC782^XrpShrArg6l761uArgC7fii^pLeu6luA8pA]^AapA^g . 
1921 GlAQCTGCnGCSAISTQGkJXGQQGO^ 

GluLettSgProL eu L eu L euT hrlhraagtaj^TrpGlnVal^ 
1981 6A6CIC1U3(!GCGXTACT6CI6ACCACTACACAGIIG(^ 

CTCGASaTOSGGCA&IGAOQACTGGTGATGTGTCAGCGTGCA^^ 

ThrlieaPzDftlBLeuSexTiurGlyLeuileKlsLeuHisGlnAsnllevai^ 
2041 ACCCIIACQGCCirTGrcCJUXKSGCCax:^^ 
(TGGGaiTGGICSGAACAGGTGGCCGGAiSTJ^^ 

TyrlJeuTyrGlyvalGlySerSerllaAlaSearTrpAlalleLysTrpGluTyrV 
2101 TACTTGT«(X»;6<SI(3(3(^TC3U^GCAICGCGTCC!rGGGCCAT!I^ pi a 17- 

AT6AACA!Emx:ACa:ClU3I7C6ZAGC6CAG^ 

LetiZ^aPliaieuLeuLeuAlaAfipAlaArgValCysSexCyaLeuTrplIetliet^ 
2161 CTCGTCTSaSTTCaXSCTOXSCAGACG^ 

GAGGAC2U^AAfiAG6AA06TCTGC6CGCGCA6ACGAGGAC6AACACCTACTACGAT^^ 

IleSerGlnAlaGluAlBAlaLeuGluAsnLeuValll^LeuAsnAlaAlaSerLeuAla 
2221 ATAT0CCAIlGG66AGGC66CZ!rTGGA6AACCTC6XAAXACITAAT(^^ 
lATAGGG!ITCGCCTCC6CCG%AACCTC»m36A6CAT!]!ASR;AAm 

GlyThrmseiyLeiAralStetPheLeuValPheffhecyaPheAlaTrpTj^r^u^ 
2281 GGGRSSaOBGSCTK'SIiXCCTTCCTOeTGT^ 

CCCZGCGOlECCAGAACATAGGAAGGAGCACAAGAAGACGAAACGTACCAXAAACTTCCa^ 

LysTtpValProGlyAlaValTyrThrPheTyrGlyMetlrpProIieuleuLeiiLeuLi&u 
2341 AAGZGGGXmXiGAGCGGSCTACACCTTCTACGGGAa^IGGCClCTCCTCCTGCTCCTO 

LeuAlalfiiPzoGlnArgAlaTyrAlaX^uAapZhxGluValAIaAlaSexQrsGli^^ 
2401 TTG606TimX»lSa;GGC6TAC6C6CT6GACAC6GA66lGGCC»^GT^ 

AACCGCAACGSGGTOGCCOGCAO^CGCGACCIGTGCClCCACCGGCGCAGq^ ; . 

ValvalZieaValGlyLet^MiatAlaLeaThrlieuSerProTyrTiTLysAs^ 
2461 GTT6TTC3£GT06G6TT6A!IGGCGCTGACTC!rcTCACC^^ 

CAACAA6aSCAGCCCAACaSlCC6CGACTGAGACAGTG6TAXAAn$T3X:^^ ' 

' Asn '■ .■'l'[:.<''i'^W'j!v: ' V' 

TrpCvsLenTrpIriajeuSlnTvrPhaLettThrArtfVa^ ,. .. v 

2521 I(3GTGCTVTG6T6GCTIC&6TATTT!rcT6ACCAGAGi!G^iA6^ 
ACCAOGA2tfACCACC6AAGTCATAAAASACTGGICICACC^^ 

IleProPi*I«uAsnValArgGlyGlyArgAspAlaValtI^t4^lliM 
2581 ATTCOCCaCTCAACGTCOGAGGGGGGCGCGACGCCGTdtfXJT } • 

(TAAGGGGGSGAGTTGCaGGCTCCCCCCGCGCTGCGGCAGIASilMiP^ 
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2701 



2761 



2621 



2881 



2941 




TSGMcsseiiusTscMccccasKaaTOs^^ 

3001 

GsacAASoswGWMicccaMcccTcaKau^^ 

TTCCCCA£!CTCCaACGACCGC(MGTiU3ltK:CGCaTGCG(M« 




3181 ATTGaJCTCAACTGCTGCCCAidccrTCCTeGcJ^ Pig. 17-4 

^^^^sGlyAlaGlyThrArgThrIleAlaSerPrcSLvsGlvProv^lTl<»riwM*^ 
3241 STCTAC<»Q3GG(K;CGGAAC6AGGACCaTCGCGTCACcS^ 
Ci^TGGTOCCCCGGa:!ITCCT(XTKII!AGCG^ 

ATATGGmCATCTGGTTCTGGAACACPeeacpraa^ 



3361 
3421 



IfiTGGGACSTGAACGCCGASGAGCCTGQAAAIGGMCASI^^ 



ProValArgArgArgGlyAepSerArgGlySerLeuLeuSerProArtTPxoIleSerTvr 
GGGOVCGOGGCCGCCCCaCTATCGTCCCCGlMGaCGAa^^ 

l«uLysGlyS6rSeiGly61yProl^ulAueyePj?^la^M6^ 
3481 WGMAfiSCTC^^ 




3541 AGGGCCGCGGTGaX^CCCGTGGAGlfiGCT&AfiGGfiGi^^ 
TCCCG6CGCCaCACGTC6GCACCTC»C06ATia»C^^ 
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LeuGluThrThiMatiUrgSerProValPheThrAspAsnSerSerProProValValPro 
3601 CTAiSaGAaACCATGAMTCCCCGGTfiTTCACGSATAACTCCTCTC 
GATCTCreTTGGTACTCCa6GG©:CACAAGTGCCiaTTG^ 

GlnSerPheGlnValAlaHisLeuHlsAlaProThrGlySerGlyLysSerThrLysVal 
3661 C^GAGCTXCCAGGTGGCTCaCCTCCAOfiCTCCCAaUWCaGCGGC^^ 
6ICtCSAftSGTCC2«:CGAGTGGASG!ia03AG6GTGTCC6TCGCC»TTTT 

PraiaaAlaTyxJaaAlafilnGlyOYrLysValLeuValLeuAsnProSerValALaAla 
3721 CCGGCTGCATATCCAGCOXaeGGCTATMGGTGCiaWSTACTCAACCCCTCTC 

GGCCGACGTATAOSTCGASTCCCGAiaTTCCaumTCATGASTIGGGGAGAC^ 

Leu 

ThrLeuGlyPheGlyAlaTyiMetSerLTsAlaHifiGlyllaAspProAsnlleArgThr 
3781 ACACTGGGCTTTGGTGCmCATCTCCAAGGCKaTGGGATCGAICCTAACATCAGGAC 
TCTGACCaSAAACCACGAATQTACASGTTCCGAGiaCCCTAGCiaGCATTCIAST^^ 

GiyValArgThrlleThrlhrGlySerProileoairllyrSexThrTyrGlyLysPhete 
3841 GGGGTGAGftAOU^TOACCACWSGCAGCaCATCACGTACTCCACCTACGGCAAGTTCCT^ . . 
CCCCACTCTTGTIAATGGTCACCfiTCGGGGTASTGCATGAGGTGGATGCOGTIC^ ' . ' i- - 

AlaA8PGlyGlyCysSBrGiy6lyAlaa5rrAsplielleileC5rflA 
3901 GCCGACGSCGGGTGCTCGGGGGGCGCTTATGACATAATAATrTGTCACGAeTGCCACTCC 
CGGCTGCCGCCCACGAGCCCCCCGCGAAIACTCTATTACTAAACACTGCTCACSGT^^ 

ThrAspALaThrSerneLeuGlylleGlyOSuJVali^uaspGlnAlaGluJrhrAlaGly 
3961 ACGGATGCCACATCCATCTTGGGCATCGGaiCTGTCCTTGACCAAGCaGAGACTGOGGGG 
TGCCTACGGXGaaGGTAeAACCCGTAGCa3TGACAfi6AACTGGTTCGTCTCTGA(X!CCGC 

MzLftxgLeuValVall^uAlaThrAlaThrProProGlyServaiThiValProHlsPro 
4021 GCGAGACTGGTTCTGCTCGCCACCGCCACCCCTCCGGGCTCCGKaCTCTGCCCCATC^ 

CGCICTGACC&ACACGAGCGGTGGCGGTGGGGAGGCCCGAGGCAGTGACACGGGGTAGGG ^^^^ j^^^g 

AsnIleGluGluV&lMal*uSerThr!EhrGlyGluIleProPheTyrGl3^ysAlalle 
4081 AACATCGAGGAGGTTGCTClGXCCACCftCCSGAeAfiATCCCXTTTTACGGCAAGGCTATC 
ITGTAGCTCCKCAACGAGaCAGGTGGTGGCCTCTCTAGGGAAAAATGCCGTTCCGA'CAG 

ProLeuGluVallleLysGlyGlyArgHlsLeullePheCysHisSerLysLyflLyaCys 
4141 CCCCTCGAAGTAATOAGGGGGGGAGACATCICATCTTCTGrCATTCAAAGAAGAAG^ 
GGGGAGC1^T1AGTTC(XX:CCCICTGIAGAGTAGAAGAC»STAAGTTTCT 

AspGluLeuAiaAlaLysLeuValAlalAuGlyileAsnAlaValAlaTyrTyrArgGly 
4201 GACGAACTCGCCGCaAAGCTGGTCGCATTGGGCATCAAIGCCGTCGCCTACTACra 

CTGCTTGAGOSGCGTTTCGACCAGCSTAACCroTAGTTACGGCAGCGGaTaATGGC^^ 

LeuAspValSerVallleProThrSexGlyAspValValValValAlaThrAapAlaLeu 

4261 CTTGAKKTCCGTCaTCCCGACCAGCGGCGAlGTTGTCGTCGTCGCAA^ • 
GAACTGCACaGGC2W5TAGGGCTGGTCGCCGCTACAACAGCAGCACCGTTSGCXACGGGAG 

Tyr 

MetThrGlyTyxThrGlyABpPheAspSerVallleAspCyaAanThrCysValThrGln 
4321 AIGACCGGCTATACCGGCGaCTTCSACa?CGGTGATAGACrGCAAIACGTGT6TCACCCAG 
TACTCGCCGATATGGCCGClGAAGCTGAGCCACTATCTGACGTIATGCACACAfilGGGTC 



Tlurva] 

A ft 1 AdJiGTC^ 

tgtSgctaamtcggaactgg^ 

AlaValSerArgtrhriSlntogArgGlyArgThrGlytegGiyLy^ 
4441 GCTGTCTCCCGCACTmCGTCGGG:(XaUKAGIGGCAQGG« 



4501 



PheValAlaPi?6eiyS£'iai|i . 
TTTGTGGCACCGGGGGAlSCGCCCCi 



erS^eryalLeuCysGluCys 
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TyrAspAlaGl;C7SAlaW77z€lulieuThrPXQAla61uShrTh3f7ttlAr9Le^ 
4561 TAT6AGGCAfi6C3S!iX3CTT66TAa!6AfiCZCA0GCGCGCCGftSa^ 
ATftCIG06TCCGACIlC^aACCATaCTCGil6Ta:G66CGGC!I!C3^ 

AlaT7tMetAsnThrProGl3^uProyalCysGInAapHls£Au61uPhe^^ 
4621 GCGTACAItGAftCMXCCGGGGCTTCCCGIGTGCOU^CC^ 
CeCATOTACTIIGIGGGGCCCCeAAGGGCAiCAOGGICCT^ 

ValPheZiu:6lSlieuThrHl8llei^pAlaHlaPh«LeuSer61nIhrLyfi61nSezGl7 
46S1 6!Ix:T!rTACAGGOCa<GAC!3X:ATAT2U3AS!GCCCACTl?^ 

CAGAAAieiCCG6A6TGA6TATATC3SkCm3TGAiU^XASG6^ 

GluAsnlAuProTyrlAuV^lMaTyxGlJiAlaThzValC^sAlaArgftlBGl^ 
4741 GAGAACCZO^CCTXACCIGGSAGCaSTACCAAGCCAGGGTGZSCGCTAGGGCI^ 
CICTI66AA86AA!rGGACCATC6CA!«3GTTCGG!im::ACA^^ 

ProProSerTrpAspGlnMetTrpLysCyaLeuIleArglieuLyaPro^ 
4801 CCCCCATGGfTGGGACCAGASGIGGAAGTGTTTGAIXCGCCrC^^ 

ProThrPrdLeuLeuTyrArgLeuGlyAlaVhlGlitfUnGluZleThxLettlhrHi^ 

GGTTGTGG66A0GAIAT6TCTGACCC6C6ACI2U^TCmGIII^ 

ValThrLysTyrlleNetxhrCysltetSerAlaAspLeuGluVUValStffSerTlairr^ 

4921 GTCACCAAATACATCAT6ACAT6CAT6TCG6CCGACCTG6AG6IC6TCA06A6CACCIG6 
CA6TGGTTTAT6!IA637ACT6IAC6TACAGCCGGCT66AC^^ 

ValLeuVal61y(>iyValLeuAlaAlaLeuAlaAlaTyxC7sLeuSexTl3Z<31yCyd\ral 
4981 GTGCTCGTTGGCGOOSTCCTGGCTGCTTTGGCC^CGTATIGC^ 

CACGA6CAACCGCC6CA6GACC6ACGAAACC6GC6CATAAC6GACAG!FTG!K^ 

ValllevalGlyAr^alValLeiiSerGlyLysProAlailellePxQAspArgGluVal ^ * 
5041 GICATAGTGGGaUSGGTCGTCTTGTCCGGGAAGCCGGCAAICAZIu:^!^^ 
CAGIATCACCCGTCCCAfiCAfiAAaU^CCCTZCGGCCSTT 

LeuTyrArgGluPheAspGluMdtGlUGluCysSexGlnHlsLeuPxofyrlleSlu^^^ 
5101 GTCTACC6A6AfiTTC6AT6AGAT6GAAGAGTG(3!CICAGC2U:T^ 

GA6AT6GCICTC2A6CTACTCTACCTTCTCACGA6J^T06IE6AAT6GCAT8!I^ 

61]^tMetLeuAlaGlu61nPheLyeGlnLysAlaLeuGlyi£i]Leu6In!n^ 
5161 GGGATGATGCIOGCGGAGCAGTTCA^GCAGAAGGCCCTOGGCGTCCZGCJUSACCGCGXCC 
CCCIACTACGA6CGGCZC6TCAAGZTCGTC!rTCCGGGAGCa36AG6ACX;ZC<^ 

ArgGlnAlaGluValllaAlaProAlaValGlnXhrAsnTrpGlilLyaLauGluThi^h^ 
5221 CGTCAG6CAGAG6TTATC6CCCCI6CTQTCCAGACCUUICT66CAAAAM:xCGA^ 
GCM3TCC6TCIC(»ATA6C6G66ACGACAGGTCIG6XTG2^06ZTTTI6A^^ 

TrpAlaLyaRisHetlrpAsnPhelleSexGlylleGlnTyzaiettAlaGlylAuSerXhr 
5281 TG6GCGAA6CmTGTGGAAC»TCATCA6T6G6A!!!M:AATACT!I6GG6GGCIIl6T^ 
ACCCGCI!n33Z&TACa;CCTIGAA6TAGTCACCCI^ 

LeuProGlyAsnFiroAlalleiAlBSsi^tiMetAlaPhftThrAla^ 
5341 CTGCCTGGTAACa:C6CGATTQCTTCATTSATG6CITT1S^^ 

6ACGGACCAXT66GGCGGIAACGAAGTAACTACCGA2^T61XSACX3ACft6!^ 

LeuThrTlu:SexGln:airLeu[LeuPha&snIltiLeu61y61y3^V 
5401 CTAACCACTAGCGil^CCCTCCTCTTCAACATATIGGGGGGGTSGG!^^ , 
GAT!K3G TGi| ]g|^^ |^(|3GASA AG JTGTATAA^ 

AlaMaPiE^Iyliil^'ail^i^^ 
5461 GCCGCCGGCSSWesiliiAiq^^ ' 
CGGCG6G^(^CpiC<^^^ 

Se2^al6i:3^|^^^^i|Iii^utleA8pIl^uAlaGlyT^ 'Vj. 
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5521 AGTGTTGGAClBGGGW^TCCTCATlT^au^ 

TC2«3ACCIGACCCCTTCCaG6a(3TATCTG!iaGGAM 

Gly 

GlyAlaLeuValAlaPheLysileMEtSerGlyGluValProSerThrGluAspLBUVal 

,581 GGaGCTCTEGTGGCa!iTCiUlGAT(a!IJG^ 

CCTX:GaGftACACCGIAAST!I!CTftfla!aCXCG(X&CTCCAGGGGAGGXGCCro^ 




2U\T(2aCTGCCCSCCftTCCTCTCec«X^ 

TXA6A!rGaiCGGGC6GZaGGft6iU3CGGGCCI06GG£GCAXCaGCCGC^ 

r i mrj»t a rwargHiflVaiaivProSlvGluGlYAlaValGlnTrpttetAsnArgLettlle 
5701 A!H«:irca3C03GC&C6TTeSCCCG6GC62USGGGGCAGTGCMT^ 
TiaSaCGCGGCCGTGOVftCOGGGCCaXSrcO^ 

AlaPheaOaSerArgGlyAsnHi^alSMProrhrHlsTyrValProGluSerAspAla 
5761 GCCTXCGCXITCCCGGGGGaaCCaTGTaarCCCECaCGCaCirACG 
CGGaAGCG6AG6GCCC0Ca!TGGTACM2^GGG!I!GCGTG&^^ 

HiaCys 



Alaaiaaa^valOJhrAlaiieieuserseriiSUTnrva^^^^ 
5821 6CT6CCCGC6TCACm3CXax&CTCaGCaGCCICACTCTAa(XC^ 
CGACMGGOGCafiaSGAaseaaaJGAGTCGiraSGaGTGACAT^ 

HisGlntolleSerSeiGluCysThiaaurProCyBSerGlySerTrpLeute 
5881 CACXaGTGGATAAGCT0SGfiGaX3T&CCACTCCATeCTCCSGTTCCT^ 
GTGG]!Caa:raiTTC6AfiOC3!CACRa«GmGGTW:GAGGCCaA66^ 

TrpAspOSrpIleCrsGliivalteuSferAspPheLysThrTrpLeuL 




6001 



CCa<MCTCCCTGGGA!H3CCCTTTGTeTCCXBCi;«ew 
GGTGTCGJ«BGACCC!CaGGGGAAACaCaGGa06G3!CGCGCa»TATTC^ 

Gly 




GTOGACGGCATC&TGCiyCflCTCSC 

OUXaGCCGTAGOaCGIGaSAGCBRCGGTGACACCTCGACTCT^ 
AanSlvTIuatetArgllevaiGlyProArgThrCyeArgJtonMrtTrpSerGlyT^ 




ProUeAsnRlaTyrThrBirGlyProCyaatoProLauPro^ 
6181 cccfta!aaaaxMCXACac(aesG6ca:cK 

GGGmmCGG&TGTGSJm:0GGGGaCMGGGGG6AflSGACGC^ 




6341 GCG(a!Aa!eSA6flGICS!I!<a!aafiAGGAA»AT^ 



6301 



TACGTCACGGGka!GftCI»CroACRATCraUU^ 
A^CToSai&CaBiaGACTOTTiySAGTTTAO^^ 

t^,...*.,*^'*.^;*^ ........ 

PhePhBOairGliflieuAapeLyValArglAUHisArgPheAlaProProCysLysPrtJI^u 
6361' TOTTCA^AASWSGaOKGGXeCGCGTAC&TAGGTITGCGCCC^ 




Pig. 17-7. 
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ProCysGluPrtsGluPro&spValiaaValLeuThrSerMeiaau'^^ 
6481 CCTTGCGAGCC0S2^CCGGA(KT66CCGTG!I^^ 

GGMCGCICGGGCXaX^GCCTGCACCGGCACAACTGClUSGl^ 

Ile!ItaAlaGlu2U.aAla61yAr^ArgLeuMaArgGlySe3S»roProSe^ 
541 ATi^CAGCAGAGGCGGCCGGGCGAAGGrrGGCGAGGGGATC^ 
TiWPTGTC»T(3?CC6CCGGa:CGC!D!ECG&ACC^ 

SerSerAlaSarGlxiLeuSerAlaProSerLeuLysAla^aii^yaT]^^ 

66 01 TCCTCGGCiaGaaGCTATCCGCTCCATCTCTC^ 

AGGASCCGATCGGSGGATAGGCGAGSTAGAGAGSICCGZS^^ 

SerProAspAlaGliiI^uIl^luAlaAsnLeuIieul^ 
6661 TCCCCTGA!l!&C3!6A6C!rCA!rA6AGGCC^ 

AGGGGAClACGACTCGAGaiATCTCCGGTWM^^ 

IleThrArtfVilGluSerGluAsnLysValVallleLeuAspSerPheAspPrcte 

67 21 ATCACCAGGGTTOAGTCAGAAAACAAAGTGGKSA!^ 

TAGTCGTCCCAACXCAGTCTTOTCO^^ 

AlaGluGiuAspGluArgGlulleflerValProAlaGluIlel^^ 
6781 GCGGAGGAG6A0Ga6CGG6A6ATCTCCGrACCCBGAfiA2iA«3^^ 
CGCCK:ClKXaJGCT»CCCax:TAGAGG»^ 

PheAlaGlnAlaLBuProVSaTrpAlaArgProAspTTTAsn^ 
6841 TTCGCCCAGGCCCTGCCCGTTTGGGCGCSGOCGGACOaTAAOCCa 
AAGaSGGaXXMGGACGGGCAAACCCGCGCCGGCCTGATAI!^^ 

TrpLysLysPiro2^pTyTGluProProValValHisGly(^sPrc3Le^ 
6901 TGGAAAAAfia:CGACTACGAAC(i^CCTQMGXCCaaBGCTX3a^ 
ACCTTTTTCGGGCTGATCCTKMTGGAGACCAGGTACCGAC^ 

SerProProValProProProArgliysLysArgThxValValteuTlirGluSer^^ 
6961 TCCCCrcaraTGGCTCCGCCTCGGAAGAAGCGGACGGTGGOXJCTC^^ 
AGGGGAGGACACGGAGGCGGAGCCTTCTTCGCCTGCCACXa^^ 

Ser 

SerThrAlarauAlaGluLBuAlaThrArgSerPheGlySerSerSerOI^ 
7021 TCTACKCCTTG6CCG AGCTCGCCACCAGAAGCTTTGGCAG^ 
AGATQACGG&ACGGGCTOGAGCGGIXSGXCSSCGAAACCGTC^^ 

rtoGlyAspAsnTlirThr!lteSerSerGl^ 
7081 ACGGGCGACAAIACGACAACAICCTCTGAGCCCGCCCCTTa^ 
TGCCCGCTGT!CATGCTG!ra»GTAGGafiACTCGGGC^^ 

PlisAla 

AspAlaGluSeriPyrSerSeadtotProProLeuSluGlyGlwProGly 
7141 GACGCTGAGTOCTATTCCTOaTGCCCCCCCTGGAQGGGGAfiC^ 
CTGOSACTCIffiGArAAeGflSGTACGGGGGGGACCTCCCCSia^^ 

SerAepGlySe3fSrpSerThrValSerSerGliiAlaAsrA^ 
7201 AGCGACGGOTCATGGTCAACGGTCKjaGTGaGGCC^ 

TCGCTGCCCABoiacausraGCCAGaa^^ 

, SerMBtSerTyrSeart?rp!Ehr6lyAlaL©u^ 
7261 TCAATGTCTTACTCTTGGACASGCGCACTCGT^ 
ASm 

^ LauProIleAsnAlalieuSerAsnSerlAtiLeuArgHiaHlsAenl^^ 
7321 CTGCCCAaJCAAOISCACTAAGCAACaEGTTOClACGT^ 

6A0GGGaGmCGTGATTCGTT6MCRACGAa?GCAST^ 

' • ThrSerArgSerAlaCyBGlnArgGlnLysLyaValThrPhe^ 
ia^AGGmeGCAGTOCTTGCCAAAGG^ 
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AspSerHi^^rGlnAspVadLeoLysGluValLysAlaAlaMaSerLysValLysAla 
7441 GACAGCCamCCAGGAOGOaClXaAGGaSSraAAAGCAG^ 
CTGTCGGmTGGTCCTGCamSAGTTCC^ 

Phe 

AsnLein:^uSerValGlu61uAlaCy^erLeuThrProProHldSerAlaLysS6rL^ 
7501 AACTTGCTATCCGTAGAGGAAGCTIGCaeCCTGACGCC^^ 
TT6AACGAmS6CA!rC3!a:T!IX^ 

PheGlyTyreiyAlaLysAapValArgCyBHisAlaArgLyaAlaValTte^ 
7 S61 TTTGGTOaaSGGGCAiAAGAaSTCrGTTGCCATC^ 

AAACCAATACCCOGOTTTCTGCAGGClAQGGTACGGTCm 

SerVaia!rpliy0AspLe\]LGu6luA6pAsnVala3irPr^^ 
7521 TCCGTCa?GGU^6ACCTTCIIKX!AAGAC2aaB 
AGGCACACQTTCTGGAASACCTTCaJG^ 

LysAsnGlumlPheCyeValGlriProGluLysGlyGlyArgLysProAlaArgl-e 
7581 AAGAACGAGSTTrTC!reCGTTCAGCC!I5GAGAAGGG^ 
CTCOTGCamAAAGACGCAAGTCSC^^ 

V&lPheProkspLeuGlyValArgV^ 
7741 GTGCTCCOGATCTGGGOGMCGCGTGOBCSAW^ 
CA<aAGGGGCTAGACCCGCAC!GC6CA<aOGCTTra^ 

LysLeuProteuAlaValMetGlySeaSerTyrfilyPheGlna^^ 
7 801 AAGCTCCCaTGGCCGTGAOSGGAaSCTCCTACGGACT^ 
TTCGAGGGGiACGGGCACTACCCTTCGaGXaTGCCT^ 

ValGluPheteuValGlnAlalTrpLysSerLysLysThrProMetGlyPheSer^^ 
7861 GTMAATTCCTCGTGCAAGCKTCGAAGTOCAASAAA^ 
CAACTOIAAGaGCACGTTCGavCCTXCAMTTCCT 

ThrArgCysPheAspSerThrValairGluSerAspIleArgajhrGluGl^ 
7921 ACCCGCraCf TTGACTCCACAGTCaCTOASAGCGACAir CCGTACGG2i^ 
TGGGCGAOSiAACTGAGGTGTCAGTGAC^ 

GlncysCvaispLoiiAspProGlnAlaArgVamalleLysSerLeuTh^ 
7981 CAATCTTGmCCTCGACCCCCAAGOmiGTGGCCATCAAGTC 

Gly 

TyrValGly«lyPr<^euaauAffnSeaargGlyGluAsnCy8Gly^ 

8041 ^EATGTTGSGrcCCCTCTTACCAAT^^ 

ATACAACCCCC6GGAfiAAnBGT!I!AaB!I?TCX:C^ 

AlaSerGlyTalLeuThrfflirSex<:5r6Sl^ 
8101 GCBAGCGGaTACTGACAACaaGCEeTGGTAACACCCT^ 
CGCXCGCOGCATGACTGTTGMCGACACCR^TOTO^ 

AlaAlaCysirgAlaAlaGlyLeuGlnAfipCysThrMetLeuValCysGlyAspi^ 

8161 GC^fiCCTOTCGAGCCGCaGGGCTOCaGGaCTGCAC 
CGTCGGACAJCICGGCGTCCCfiAfiGimGACGT^ 

ViaviLILeCysGluSerAlaGiy\ralSlnGluAepAi 
8221 GTC»TTATCtGTGAAAGC»0GGGGG!rcCflGGAGGACGC6GCGAG^ 
CAGCAAllAGlCACTiraCGOGCCGOCAG^ 

GluAlaMetthrArgavxSerAlaProPitjQlyAspProPrGGl^^ 
8281 GAGGCTAaSiCCAGGTACOOTGCCCOCCClif^ 

CTCCQAOT^tGGTCCATGMGCGGGGGGGACCCCTGGGGGGTGTTGGXCT^ 



8341 



GluLeuIlethrSerCysSerSerAsnV^aSerValAlaHisAspGlyAlaGlyL^^ 

6AGCTCATAICA7 
CTCGAGOlATfGm 



ValTyr3?yrl^u!nirJU:gA^ 
8401 GTCTACmvCCTCRCCC»TGMCCTACRAOCCCCCT 
C2^TG&TG6AG!lEGGCaCIEGGGan!GTa!^^ 

iUrgHisThrProV&lAmiSerTrpLeuGlyAsnlleile*^ 
8461 AGkCkCliCTCa^TCAM 

AlaArgHetlleiLeuMetZhrBlsPhePheSerTUL 
8521 GCGAGGATGATACTGAaXSaC^ 
C6CTCCTACTA!EG3VC^ 

61nAlalieuAspQi^fi61ulleTyr61yAlaC7sTy^ 
8581 CAGGCCCTCSAJRimSAGAaJCTACGGGGCCT^ 
GTCC6G6A6CZAa06C37C!I!AGAT6C(n:^ 

ProlleIleGlQArgl«iLis6l7l'euS»AlaPheSea^ 
8641 CCAAICAT!ItaAAGACTCCAaBGCCTCAGC3GC^ 
GGlTAGTAAGTTXCXfiAGGZACCGGAS 

GluIlftAenAr^vai!Q.aAlaC>?sLeuArgLysr«uGlyValProP - . 

8701 GAAATTlATAGGSEGGCCGCATfGCCKAGAA^ 

CTTTAATIAICCEACGGGCCTACGSaGTCTITI^^ 

Gly 

ArgMsArgAlaArgSerValArgAlaArglAUleuAlaArg^ 
8761 AGACACCGGGCCOSGAGOSircCGCGCTAjBGCCTCTGGCCaeA^^ 

TCTCTCGCCCGGGCCTCGCAGGCGCGATOCGAASaCCGGTCTC ^ 

CysGlyLysTyrLeuPheAsnTrpAlaValArgThrLysLe^ 
8821 IGTGGCAAGTACCICCaHrCAACZGSSCAGTAAGA^ 

ACACCG3!a?CATG6&GAAGa?!r6ACCC»TCATICa^ Yig 17-10 
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